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NOVEL PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

BACKGROUND OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides. More particularly, the 
invention relates to nucleic acids encoding novel molecules (MOL) polypeptides, as well as 
5 vectors, host cells, antibodies, and recombinant methods for producing these nucleic acids and 
polypeptides. 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as MOLX, 

10 or MOL1, MOL2, MOL3, MOL4, MOL5, MOL6, MOL7, MOL8, MOL9, and MOL10 nucleic 
acids and polypeptides. These nucleic acids and polypeptides, as well as derivatives, homologs, 
analogs and fragments thereof, will hereinafter be collectively designated as "MOLX" nucleic 
acid or polypeptide sequences. 

In one aspect, the invention provides an isolated MOLX nucleic acid molecule encoding 

15 a MOLX polypeptide that includes a nucleic acid sequence that has identity to the nucleic acids 
disclosed in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29. In some 
embodiments, the MOLX nucleic acid molecule will hybridize under stringent conditions to a 
nucleic acid sequence complementary to a nucleic acid molecule that includes a protein-coding 
sequence of a MOLX nucleic acid sequence. The invention also includes an isolated nucleic acid 

20 that encodes a MOLX polypeptide, or a fragment, homolog, analog or derivative thereof. For 
example, the nucleic acid can encode a polypeptide at least 80% identical to a polypeptide 
comprising the amino acid sequences of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30. The nucleic acid can be, for example, a genomic DNA fragment or a cDNA 
molecule that includes the nucleic acid sequence of any of SEQ ID NOS: 1 , 3, 5, 7, 9, 1 1 , 13, 1 5, 

25 17, 19, 21, 23, 25, 27, and 29. 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 
includes at least 6 contiguous nucleotides of a MOLX nucleic acid (e.g., SEQ ID NOS:l, 3, 5, 7, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29) or a complement of said oligonucleotide. 

Also included in the invention are substantially purified MOLX polypeptides (SEQ ID 

30 NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30). In certain embodiments, the 
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MOLX polypeptides include an amino acid sequence that is substantially identical to the amino 
acid sequence of a human MOLX polypeptide. 

The invention also features antibodies that immunoselectively bind to MOLX 
polypeptides, or fragments, homologs, analogs or derivatives thereof. 
5 In another aspect, the invention includes pharmaceutical compositions that include 

therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. The therapeutic can be, e.g., a MOLX nucleic acid, a MOLX polypeptide, or 
an antibody specific for a MOLX polypeptide. In a further aspect, the invention includes, in one 
or more containers, a therapeutically- or prophylactically-effective amount of this 
10 pharmaceutical composition. 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a MOLX nucleic acid, under conditions allowing for expression of 
the MOLX polypeptide encoded by the DNA. If desired, the MOLX polypeptide can then be 
recovered. 

15 In another aspect, the invention includes a method of detecting the presence of a MOLX 

polypeptide in a sample. In the method, a sample is contacted with a compound that selectively 
binds to the polypeptide under conditions allowing for formation of a complex between the 
polypeptide and the compound. The complex is detected, if present, thereby identifying the 
MOLX polypeptide within the sample. 

20 The invention also includes methods to identify specific cell or tissue types based on their • 

expression of a MOLX. 

Also included in the invention is a method of detecting the presence of a MOLX nucleic 
acid molecule in a sample by contacting the sample with a MOLX nucleic acid probe or primer, 
and detecting whether the nucleic acid probe or primer bound to a MOLX nucleic acid molecule 

25 in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
MOLX polypeptide by contacting a cell sample that includes the MOLX polypeptide with a 
compound that binds to the MOLX polypeptide in an amount sufficient to modulate the activity 
of said polypeptide. The compound can be, e,g., a small molecule, such as a nucleic acid, 
30 peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon containing) or 
inorganic molecule, as further described herein. 

Also within the scope of the invention is the use of a therapeutic in the manufacture of a 
medicament for treating or preventing disorders or syndromes including, e.g., diabetes, 
metabolic disturbances associated with obesity, the metabolic syndrome X, anorexia, wasting 
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disorders associated with chronic diseases, metabolic disorders, diabetes, obesity, infectious 
disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's 
Disease, Parkinson's Disorder, immune disorders, and hematopoietic disorders, or other 
disorders related to cell signal processing and metabolic pathway modulation. The therapeutic 
5 can be, e.g., a MOLX nucleic acid, a MOLX polypeptide, or a MOLX-specific antibody, or 
biologically-active derivatives or fragments thereof. 

For example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from: Cancer including pancreatic cancer, adenoma, brain tumor, colon 
cancer breast cancer, prostate cancer, testis cancer, neurological disorders including age-related 

10 disorders, Alzheimer's disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, 
Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative 
disorders, Aneurysms, Fibromuscular dysplasia, metabolic disorders including, failure to thrive, 
nutritional edema, hypoproteinemia, tripsinogen deficiency disease, chronic and heriditary 
pancreatitis, enterkinase defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders 

1 5 inclusing tachycardia, erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, 

Cardiomyopathy, Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial 
septal defect (ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, 
Subaortic stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night 
blindness, brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, 

20 Hypercalcemia, Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, 
Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 
immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 

25 lung diseases, including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections and/or other 
pathologies and disorders of the like. 

The polypeptides can be used as immunogens to produce antibodies specific for the 

30 invention, and as vaccines. They can also be used to screen for potential agonist and antagonist 
compounds. For example, a cDNA encoding MOLX may be useful in gene therapy, and MOLX 
may be useful when administered to a subject in need thereof. By way of nordimiting example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from Cancer including pancreatic cancer, adenoma, brain tumor, colon cancer breast cancer, 
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prostate cancer, testis cancer, neurological disorders including age-related disorders, Alzheimer's 
disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Behavioral 
disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative disorders, Aneurysms, 
Fibromuscular dysplasia, metabolic disorders including, failure to thrive, nutritional edema, 
5 hypoproteinemia, trypsinogen deficiency disease, chronic and heriditary pancreatitis, enterkinase 
defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders inclusing tachycardia, 
erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, Cardiomyopathy, 
Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial septal defect 
(ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, Subaortic 

10 stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night blindness, 
brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, Hypercalceimia, 
Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, Tuberous 
sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 

15 immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 
lung diseases, including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections and/or other 

20 pathologies and disorders. 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g., Cancer including pancreatic cancer, adenoma, brain tumor, colon 
cancer breast cancer, prostate cancer, testis cancer, neurological disorders including age-related 
disorders, Alzheimer's disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, 

25 Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative 

disorders, Aneurysms, Fibromuscular dysplasia, metabolic disorders including, failure to thrive, 
nutritional edema, hypoproteinemia, trypsinogen deficiency disease, chronic and heriditary 
pancreatitis, enterkinase defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders 
inclusing tachycardia, erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, 

30 Cardiomyopathy, Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial 
septal defect (ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, 
Subaortic stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, nigjit 
blindness, brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, 
Hypercalceimia, Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, 
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Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 
immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 
5 lung diseases, including asthma, Emphysema, irnmundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections or other 
disorders related to cell signal processing and metabolic pathway modulation. The method 
includes contacting a test compound with a MOLX polypeptide and determining if the test 

10 compound binds to said MOLX polypeptide. Binding of the test compound to the MOLX 

polypeptide indicates the test compound is a modulator of activity, or of latency or predisposition 
to the aforementioned disorders or syndromes. 

Also within the scope of the invention is a method for screening for a modulator of 
activity, or of latency or predisposition to an disorders or syndromes including, e.g., Cancer 

1 5 including pancreatic cancer, adenoma, brain tumor, colon cancer breast cancer, prostate cancer, 
testis cancer, neurological disorders including age-related disorders, Alzheimer's disease, Stroke, 
Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Behavioral disorders, 
Addiction, Anxiety, Pain, nephropathy, neurodegenerative disorders, Aneurysms, Fibromuscular 
dysplasia, metabolic disorders including, failure to thrive, nutritional edema, hypoproteinemia, 

20 trypsinogen deficiency disease, chronic and heriditary pancreatitis, enterkinase defieciency, 

Hypercholesterolemia, Obesity, Diabetes, cardiac disorders inclusing tachycardia, erythroderma, 
long QT syndrome, heart block, Ataxia-telangiectasia, Cardiomyopathy, Atherosclerosis, 
Hypertension, Congenital heart defects, Aortic stenosis, Atrial septal defect (ASD), 
Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, 

25 Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night blindness, brugada 

syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, Hypercalcemia, Cirrhosis, 
angiogenesis and wound healing, blood pressure regulation, Trauma, Tuberous sclerosis, 
Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, 
Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, immune disorders including 

30 cell-mediated immunity disorders, Leukodystrophies, inflammation, Hyperthyroidism, 

Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, lung diseases, 
including asthma, Emphysema, irnmundeficiencies, Crohn's disease, Scleroderma, Appendicitis, 
Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, neural tube 
defects, modulation of apoptosis, viral, bacterial, and parasitic infections or other disorders 
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related to cell signal processing and metabolic pathway modulation by administering a test 
compound to a test animal at increased risk for the aforementioned disorders or syndromes. The 
test animal expresses a recombinant polypeptide encoded by a MOLX nucleic acid Expression 
or activity of MOLX polypeptide is then measured in the test animal, as is expression or activity 
5 of the protein in a control animal which recombinantly-expresses MOLX polypeptide and is not 
at increased risk for the disorder or syndrome. Next, the expression of MOLX polypeptide in 
both the test animal and the control animal is compared. A change in the activity of MOLX 
polypeptide in the test animal relative to the control animal indicates the test compound is a 
modulator of latency of the disorder or syndrome. 

10 In yet another aspect, the invention includes a method for determining the presence of or 

predisposition to a disease associated with altered levels of a MOLX polypeptide, a MOLX 
nucleic acid, or both, in a subject (e.g., a human subject). The method includes measuring the 
amount of the MOLX polypeptide in a test sample from the subject and comparing the amount of 
the polypeptide in the test sample to the amount of the MOLX polypeptide present in a control 

15 sample. An alteration in the level of the MOLX polypeptide in the test sample as compared to 
the control sample indicates the presence of or predisposition to a disease in the subject 
Preferably, the predisposition includes, e.g., diabetes, metabolic disturbances associated with 
obesity, the metabolic syndrome X, anorexia, wasting disorders associated with chronic diseases, 
metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 

20 cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 

disorders, and hematopoietic disorders. Also, the expression levels of the new polypeptides of 
the invention can be used in a method to screen for various cancers as well as to determine the 
stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
25 pathological condition associated with a disorder in a mammal by administering to the subject a 
MOLX polypeptide, a MOLX nucleic acid, or a MOLX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g., Cancer including pancreatic cancer, 
adenoma, brain tumor, colon cancer breast cancer, prostate cancer, testis cancer, neurological 
30 disorders including age-related disorders, Alzheimer's disease, Stroke, Parkinson's disease, 

Huntington's disease, Cerebral palsy, Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, 
nephropathy, neurodegenerative disorders, Aneurysms, Fibromuscular dysplasia, metabolic 
disorders including, failure to thrive, nutritional edema, hypoproteinemia, tripsinogen deficiency 
disease, chronic and heriditary pancreatitis, enterkinase defieciency, Hypercholesterolemia, 
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Obesity, Diabetes, cardiac disorders inclusing tachycardia, erythroderma, long QT syndrome, 
heart block, Ataxia-telangiectasia, Cardiomyopathy, Atherosclerosis, Hypertension, Congenital 
heart defects, Aortic stenosis, Atrial septal defect (ASD), Atrioventricular (A-V) canal defect, 
Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, Ventricular septal defect (VSD), valve 
5 diseases, Larsen syndrome, night blindness, brugada syndrome, Von Hippel-Lindau (VHL) 
syndrome, Tuberous sclerosis, Hypercalceimia, Cirrhosis, angiogenesis and wound healing, 
blood pressure regulation, Trauma, Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal 
artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney disease, Renal 
tubular acidosis, IgA nephropathy, immune disorders including cell-mediated immunity 

10 disorders, Leukodystrophies, inflammation, Hyperthyroidism, Hypothyroidism, Lesch-Nyhan 
syndrome, Multiple sclerosis, Transplantation, lung diseases, including asthma, Emphysema, 
irmnundeficiencies, Crohn's disease, Scleroderma, Appendicitis, Autoimmune diseases, Systemic 
lupus erythematosus, developmental disorders, neural tube defects, modulation of apoptosis, 
viral, bacterial, and parasitic infections. 

15 In yet another aspect, the invention can be used in a method to identity the cellular 

receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 
Unless otherwise defined, all technical and scientific terms used herein have the same 

20 meaning as commonly understood by one of ordinary skill in the art to which this invention 

belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 

25 specification, including definitions, will control In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The invention is based, in part, upon the discovery of novel nucleic acid sequences that 

encode novel polypeptides. The novel nucleic acids and their encoded polypeptides are referred 
to individually as MOL1, MOL2, MOL3, MOL4, MOL5, MOL6, MOL7, MOL8, MOL9, and 
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MOL10. The nucleic acids, and their encoded polypeptides, are collectively designated herein as 
"MOLX". 

The novel MOLX nucleic acids of the invention include the nucleic acids whose 
sequences are provided in Tables 1 A, 2A, 3A, 4A, 5A, 6A, 6D, 7A, 8A, 8D, 9A, 9D, 9F, and 
5 10A. inclusive ('Tables 1A - 10A"), or a fragment, derivative, analog or homolog thereof. The 
novel MOLX proteins of the invention include the protein fragments whose sequences are 
provided in Tables IB, 2B, 3B, 4B 5B, 6B, 6E, 7B, 8B, 8E, 9B, 9E, 9G and 10B inclusive 
("Tables IB - 10B"). The individual MOLX nucleic acids and proteins are described below. 
Within the scope of this invention is a method of using these nucleic acids and peptides in the 
1 0 treatment or prevention of a disorder related to cell signaling or metabolic pathway modulation. 



MOL1 

A disclosed interleukin-1 receptor/Toll-like nucleic acid of 1050 nucleotides, MOL1, is 
shown in Table 1 A. The disclosed MOL1 open reading frame ("ORF") begins at the ATG 
initiation codon at nucleotides 1-3, shown in bold in Table 1 A. The encoded polypeptide is 
15 alternatively referred to herein as MOL1 or as GMJ79960178. The disclosed MOL1 ORF 

terminates at a TGA codon at nucleotides 3043-3045. As shown in Table 1 A the start and stop 
codons are in bold letters. 



Table 1A. MOL1 nucleotide sequence (SEQ ED NO:l). 

ATGCTGGCCATGACCCTGGCCCTGGGTACCTTGCCTGCCTTCCTACCCTGTGAGCTCCAGCCCCACGGCCTGGTGAA 
CTGCAACTGGCTGTTCCTGAAGTCrcTGCCCCACTTCTCCATGGCAGCACCCCGTGGCAATGTCACCAGCCTTTCCT 
TGTCCTCCAACCGCATCCACCACCTCCATGATTCTGACTTrGCCCACCTGCCCAGCCTGCGGCATCTCAACCTCAAG 
TGGAACTGCCCGCCGGTTGGCCTCAGCCCCATGCACTTCCCCTGCC^CATGAC(^CGAGCCCAGCACCTTCTTGGC 
TGTGCCCACCCTGGAAGAGCTAARCCTG^.GCTACAACAACATCATGACTGTGCCTGCGCTGCCCAAATCCCTCATAT 
CCCTGTCCCTCAGCCATACCZAACATCCTGATGCTAGACTCTGCCAGCCTCGCCGGCCTGCATGCCCTGCGCrTCCTA 
TTCAT GGACGGCAACTGTT ATTACAAGAACCCCTGCAGGCAGGCACT GGAGGTGGCCCCGG GTGC CCTCCTTGGCCT 
GGGCAACCiraCCCACCTGTCACTCAAGTACAACAACC^ 

ATCTGCTGTTGTCCTACAACCGCATCGTCAAACTGGCGCCTGAGGACCTGGCCAATCTGACCGCCCTGCGT3TG 

GATGTGGGCGGAAATTGCCGCCGCTGCGACCA(^CTCC^ 

ACATCCCGATACCTTCAGCCACCTGAGCCGTC^ 

CCAGT TGGTTCCGTGGGCTGGGAAACCTCCGAGTGCT GGACCTGAGTGAGAACTTCCTCTAC AAAT GCATCACTAAA 
ACCAAGGCCTTCCAGGGCCTAACACAGCTGCG 

CCACCTGTCTCTGGCCCCTTCCTTCGGGAGCCTGGTCGCCCTGAAGGAGCTGGACATGCACGGCATCTTCTTCCGCT 

CACTCGATGAGACCACGCTCCGGCCACTGGCCCGCCTGCCCATGCTCCAGACTCTGCGTCTGCAGATGAACTTCATC 

AACCAGGCCCAGCTCGGCATCTTCAGGGCCTTCCCTGGCCTGCGCTACGTGGACCTGTCGGACAACCGCATCAGCGG 

AGCTTCGGAGCTGACAGCCAC(^TGGGGGAGGCAGATGGAGGGGAGAAGGTCTGGCTGCAGCCTGGGGACCTTGCTC 

CGGCCCCAGTGGACACTCCCAGCTCTGAAGACTTCAGGCCCAACTGCAGCACCCTCAACTTCACCTTGGATCTGT^ 

CGGAACAACCTGGTGACCGTGCAGCCGGAGATGTTTGCCCAGCTCTCGCACCTGCAGTGCCTGCGCCTGAGCCAa^ 

CTGCATCTCGCAGGCAGTCy^TGGCTCCCAGTTCCTGCC^ 

AGCTGGACCTCTACC^CGAGCACTCATTCACGGAGCT^ 

CCCTTTGGCATGCAGGGCGTGGGCCACAACTTCAGCTT 

CCACAACAACATCCACAGCCAAGTGTCCCAGCAGCT^ 

CACTGGGCCATATGTGGGCCGAGGGAGACCTCTATCTGCACT^ 

TTGTCCCAGAACCGCCTGC^CACCCTCCTGCCC^^ 

CCGTGACAAJ^ACCTGGCCTTCTTTAAGTGGTGGAGCCTCCACTTC^ 

GAAACCAGCTGAAGGCCCTGACCAATGGCAGCCTGCCTGCTGGC^^ 

AGCATCAGCTTCGTGGCCCCCGGCTTCTCTTCC^ 

CAAGACAGTGGACCACTCCTGGTTTGGGCCCCTGGCGAGTGCGCTGCAAATACTAGATGTAAGCGCCAACCCTCTGC 
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ACTGCGCCTGTGGGGCGGCCTrTATGGACTTCCTGCTGGAGGTGCAGGCTGCCGTGCCCGGTCTGCCCAGCCGGGTG 
AAGTGTGGCAGTCCGGGCCAGCTCCAGGGCCTCAGC^^ 

CTCCTGGGACrGTTTCGCCCTCTCGCTGCTGGCTGTGGCTCTGGGCCTGGGTGTGCCX^TGCTGCATCACCTCrrGTG 
GCTGGGACCTCTGGTACTGCTTCCACCTGTGCCTGGCCTGGCTTCCCTGGCGGGGGCGGCAAAGTGGGCGAGATGAG 
GATGCCCTGCCCTACGATGCCTTCGTGGTCTTCG^ 

TCGGGGGOU3CTGGAGGAGTGCCGTGGO:GCTGGGCACTCCGCCTGTGCC?GGAGGAACG^ 

AAACCCTCTTTGAGAACCTGTGGGCCTCGGTCTATGGCAGCCGCAAGACGCT 

GTCAGTGGTCTCTTGCGCGCCAGCTTCCn^GCT^ 

GGTGATCCTGAGCCCTGACGGCCGCCGCTCCCGCTATGTGCGGCTGCGCC^GCGCCTCT^CCGCCAGAGTGTCCTCC 

TCTGGCCCCACCAGCCCAGTGGTCAGCGCAGCTTCTGGGCCC^^ 

TTCTATAACCGGAACTTCTGCCAGGGACCCACGGCCGAATAG 



A disclosed encoded MOL1 protein has 346 amino acid residues, referred to as the 
MOL1 protein. The MOL1 protein was analyzed for signal peptide prediction and cellular 
localization. SignalP results predict that MOL1 is cleaved between position 41 and 42 of SEQ 
5 ID NO:2. Psort and Hydropathy profiles also predict that MOL1 contains a signal peptide and is 
likely to be localized in the plasma membrane (Certainty=0.4600). The disclosed MOL1 
polypeptide sequence is presented in Table IB using the one-letter amino acid code. 



Table IB. Encoded MOL1 protein sequence (SEQ ED NO:2). 

WN C P P VGLS PMH FPCHMT I E P ST FLAVPT1EELNLS YNNIMT VP Al PKSLISLSLS HTNI LMLDS AS LAGLHALRFL 

FMDGNCYYKNPCRQALEVAPGALLGLGNLTHLSL^ 

DVGGNCRRCDHAPNPCMECPRHFPQLHPOTFSHLSRLEGLVL^ 

TKAFQGLTQLRKLNLS FNYQKRVS FAHLSLAPS FGSLVALKELDMHGIFFRSLDETTLRPIJ^RLPMLQTLRLGMNFI 
NQAQLGI FRAFPGLRYVDLSDNRI SGASELTATMGEAIX3GEKVWLQPGDIAPAPVDTPSSEDFRPNCSTLNFTLDLS 
RNNLVTVQP3MF7VQLSHLQCLRLSHNCISQAVNGSQF^ 

PFGMQGVGHNFSFVAHLRTLRHLSLAHNNIHSQVSQQLCSTSLRALDFSGNALGHMWAEGDLYLH 
LSQNRLHTI^PQTIiRN&FKSLQVLRLRDNY^ 

S ISFVAPGFFSE^AKELRELNLSANALKTVDHSWFGPLASALQILDVSANPLHCACGAAFMDFLLEVQAAV 

KCGSPGQLQGLSIFAQDLRLCLDEALSWIX^FALSLLAVALGLGVPMLHHLCGWDLWYCF^ 

DALPYDAFWFD?CTQSAVADWVTNELRGQIiEECRGRWALRIjCLEERDWLPGKTLFENLWASVYGSRK^ 

VS GLLRAS FLLAQQRLLE DRKDVWLVI LS PDGRRSRYVRLRQRLCRQS VLLWPHQPSGQR S FWAQLGMALT RDNHH 

FYNRNFCQGPTAE 



10 MOL1 was initially identified on chromosome 3 with a TblastN analysis of a proprietary 

sequence file for a G-protein coupled receptor probe or homolog, which was run against the 
Genomic Daily Files made available by GenBank. A proprietary software program (GenScan™) 
was used to further predict the nucleic acid sequence and the selection of exons. The resulting 
sequences were further modified by means of similarities using BLAST searches. The 

1 5 sequences were then manually corrected for apparent inconsistencies, thereby obtaining the 
sequences encoding the full-length protein. 

A region of the MOL1 nucleic acid sequence has 690 of 1203 bases (57 %) identical to a 
Homo sapiens Toll Receptor mRNA (GENBANK-ID: AL137451), with an E-value of 5.7X10" 8 . 
In all BLAST alignments herein, the ''E-value" or "Expect" value is a numeric indication of the 

20 probability that the aligned sequences could have achieved their similarity to the BLAST query 

sequence by chance alone, within the database that was searched. For example, the probability 
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that the subject ("Sbjcf ') retrieved from the M0L1 BLAST analysis, e.g., the Homo sapiens 
MOL, matched the Query M0L1 sequence purely by chance is SJxlO" 8 . 

A BLASTX search was performed against public protein databases. The full amino acid 
sequence of the protein of the invention was found to have 342 of 900 amino acid residues (38%) 
5 identical to, and 493 of 900 residues (54%) positive with, the 1049 amino acid residue Toll-like 
Receptor 7 protein from Homo sapiens (ptnr:SPTREMBL-ACC AAJF60188) . 

The amino acid sequence of MOL1 also had high homology to other proteins as shown in 
table 1C. 



Table 1C. BLAST results for MOL1 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


Patp:W86365 


DNAX toll-like 
receptor DTLR- 
10 [Homo 
sapiens] 


336 


335/336 
(99%) 


335/336 | 
(99%) 


0.0 


gi|8394456!ref |NP 
_059138.1| 


toll-like 
receptor 9 
[Homo sapiens] 


1032 


960/1014 
(94%) 


960/1014 
(94%) 


0.0 


gi| 13648665 |ref |X 
P_003236.2| 


toll-like 
receptor 9 
[Homo sapiens] 


1014 


960/1014 
(94%) 


960/1014 
(94%) 


0.0 


gi|8099654|gb|AAF 
72190. 1|AF259263 
1 


.toll-like 
receptor 9 form 
B [Homo 
sapiens] 


975 


921/975 
(94%) 


921/975 
(94%) 


0.0 


gi|13507173lgb|AA 
K28488.1IAF314224 
1 


toll-like 
receptor 9 [Mus 
mus cuius] 


1032 


720/1015 
(70%) 


799/1015 
(77%) 


0.0 



10 

A ClustalW analysis comparing disclosed proteins of the invention with related OR 
protein sequences is given in Table ID, with MOL1 shown on line 1 . 

In the ClustalW alignment of the MOL1 protein, as well as all other ClustalW analyses 
herein, the black outlined amino acid residues indicate regions of conserved sequence (z.e., 

15 regions that may be required to preserve structural or functional properties), whereas non- 
highlighted amino acid residues are less conserved and can potentially be mutated to a much 
broader extent without altering protein structure or function. Residue differences between any 
MOLX variant sequences herein are written to show the residue in the "a" variant and the residue 
position with respect to the "a" variant. MOL residues in all following sequence alignments that 

20 differ between the individual MOL variants are highlighted with a box and marked with the (o) 
symbol above the variant residue in all alignments herein. 
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Table ID. ClustalW Analysis of MOL1 

1) Novel HOL1 (SEQ ID NO: 2) 

2) gi|8394456|ref |NP_059138.1| toll-like receptor 9 [Homo sapiens] (SEQ ID 
NO: 31) 

5 3) gi|13648665|ref 1XP_003236.2| toll-like receptor 9 [Homo sapiens] (SEQ ID 

NO:32) 

4) gi|8099654|gb|AAF72190.1|AF259263_l toll-like receptor 9 form B [Homo 
sapiens] (SEQ ID NO: 33) 

5) gi 1 13507173 1 gblAAK28488.1|AF314224_l toll-like receptor 9 [Mus musculus] 
10 (SEQ ID NO: 34) 



15 



20 



25 



PROl 

NP_059138, 
XPJ)03236. 
AF259263 1 



10 



20 

• I ■ 



30 
- I - 



40 
• I 



50 
-I - 



60 
I 



AF314224_1 MVLRRRTLHP LSLLVQAAVjJ 



PROl 

NPJJ59138. 
XP_003236 . 
AF259263_1 
AF314224 1 




30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



PROl 

NP_059138. 
XPJ303236. 
AF259263_1 
AF314224 1 



PROl 

NP_059138. 
XP_003236. 
AF259263_1 
AF314224 1 



PROl 

MP 059138. 
XPJ)03236. 
AF259263_1 
AF314224 1 



PROl 

MP 059138. 
XP~003236. 
AF259263_1 
AF314224~1 



PROl 

NP_059138 . 
XP 003236. 
AF259263_1 
AF314224 1 



PROl 

MP_059138. 
XP_O03236. 
AF259263_1 
AF314224 1 




.1. 



150 
..I 



160 
.1 . 



170 
..I • 



180 
■ I 



LPKSLISL3L 
LFKSLISL3L 
LFKSZiISLSL 
LPKSLISLSL 

lp3sl2Blsl 



190 
..I . 



200 




jpCRQALEV 
SpCRQALEV 

Secrqalev 
Spcrqalev 





210 


LTHLSLKY 




LTHLSLKY 




LTHLSLKY 




LTHLSLKi 




LTHLSLKY 


[nn 




270 





FAHLSLAPSF 
FAHL SLAPS F 
FAHLSLAPSF 
FAHLSLAPSF 
F7,|aT.!l|T,AESF 



342 
360 
342 
303 
360 




490 



500 



510 



520 



530 



540 
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10 



15 



20 



PROl 

NP_0S9138. 
XP_003236 . 
AF259263__1 
AF314224 1 



PROl 

NP_059138 . 
XPJ>03236 . 
W259263_l 
AF314224 1 





.1. 



630 

. . I * 



YLHFFQGLSG 
YLHFFQGLSG 
YLHFFQGLSG 
YLHFFQGLSG 
YLHFFQGLSG 




25 



30 



35 



40 



45 



G70 
..I . 



680 
. . I 



PROl 

NPJ>59138 . 
XPJ>03236. 
AF259263_1 
AF314224 1 



PROl 

NPJ>59138 . 
XP_003236. 
AF259263J. 
AP314224 1 



PROl 

NPJ>59138. 
XP_003236. 
AF259263_1 
AF314224 1 




1 690 


EVLDLAG 




EVLDLAG 


Soil 


EVLDLAG 




EVLDLAG 




EVLDLAG 


3ql| 


750 



700 
• I 



710 

• 1 - 



720 

-.1 




760 
■ I . 




.1 . 



770 
..I 



PLHCACGAA 
PLHCACGAA 
PLHCACGAA 
PLHCACGAA 
PLKCACGAA 



810 
. . I 



SIFAQDLRLC 
SIFAQDLRLC 
SIFAQDLRLC 
SIFAQDLRLC 
SIFAQDLRLC 





50 



55 



60 



65 



70 



75 



PROl 

NP_059138. 
XP_O03236. 
AF259263_1 
AP314224 1 



PROl 

NP 059138. 
XP_003236. 
AF259263_1 
AF314224 1 



PROl 

NP_059138. 
XPJ303236. 
AF259263_1 
AF314224 1 



PROl 

NP 059138. 



.1. 



850 



860 
..I . 



gwdlwycfhl 
gwdlv7ycfhl 
gwdlv7ycfhl 
gwdlwycfhl 
gwdEwycfhl 



870 

. I 



680 



■ I . 




QSAVADftVY 
QSAVADtfVY 
QSAVADWVY 
QSAVADttVY 
QSAVADKVY 



910 



920 
..I . 



930 




. I. 



940 

..I . 




3RKTLFVLAH 
3RKTLFVLAH 
3RKTLFVLAH 
SRKTLFVLAH 
SRKTLFVLAH 



970 
.1 . 



980 



ED3XDVVV1V 
EDRKDVVVLV 
EDRKDVVVLV 
EDRKDVVVLV 
EDRKDVVVLV 




990 
..I • 




SFLLAQQRLL 
SFLLAQQRLL 
SFLLAQQRLL 
SFLLAQQRLL 
SFLLAQQRLL 



941 

959 
941 
902 
959 



YVRLRQRLCF QSVLLWPnQF 
YVRLRQRLCH QSVLLWPHQP 
YVRLRQRLCB QSVLL7JPHQP 
YVRLRQRLCR QSVLLWPHQP 
YVRLRQRLCR QSVLpP^O? 
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XPJJ03236. ^nIn^^ jjjjj 1014 



5 

The interleukin-1 (EL-1) receptor/Toll-like receptor (TLR) superfamily is a recently 
defined and expanding group of receptors that participate in host responses to injury and 
infection. The superfamily is defined by the Toll/IL-1 receptor (TER) domain, which occurs in 
the cytosolic region of family members, and is further subdivided into two groups based on 

10 homology to either the Type I IL-1 receptor or Drosophila Toll receptor extracellular domain. 
The former group includes the receptor for the important Thl cytokine IL-1 8, and T1/ST2, 
which may have a role in Th2 cell function. The latter group includes six mammalian TLRs, 
including TLR2 and TLR4, that largely mediate the host response to gram-positive and gram- 
negative bacteria, respectively. Whether bacterial products are actual ligands for TLRs, or 

1 5 whether they generate ligands via as yet unidentified pattern recognition receptors, has yet to be 
determined. Signaling pathways activated via the TER domain trigger the activation of 
downstream kinases, and transcription factors such as NF-kappaB, and involve the adaptor 
protein MyD88, which itself contains a TER domain. 

As our primary interface with the environment, the skin is constantly subjected to injury 

20 and invasion by pathogens. The fundamental force driving the evolution of the immune system 
has been the need to protect the host against overwhelming infection. The ability of T and B cells 
to recombine antigen receptor genes during development provides an efficient, flexible, and 
powerful immune system with nearly unlimited specificity for antigen. The capacity to expand 
subsets of antigen-specific lymphocytes that become activated by environmental antigens 

25 (memory response) is termed "acquired" immunity. Immunologic memory, although a 

fundamental aspect of mammalian biology, is a relatively recent evolutionary event that permits 
organisms to live for years to decades. "Innate" immunity, mediated by genes that remain in 
germ line conformation and encode for proteins that recognize conserved structural patterns on 
microorganisms, is a much more ancient system of host defense. Defensins and other 

30 antimicrobial peptides, complement and opsonins, and endocytic receptors are all considered 
components of the innate immune system. None of these, however, are signal-transducing 
receptors. Most recently, a large family of cell surface receptors that mediate signaling through 
the NF-kappaB transcription factor has been identified. This family of proteins shares striking 
homology with plant and Drosophila genes that mediate innate immunity. In mammals, this 

35 family includes the type I interleukin-1 receptor, the interleukin-1 8 receptor, and a growing 
family of Toll-like receptors, two of which were recently identified as signal-transducing 
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receptors for bacterial endotoxin- In this review, we discuss how interleukin-1 links the innate 
and acquired immune systems to provide synergistic host defense activities in skin. 

In Drosophila the Toll protein is involved in establishment of dorso-ventral polarity in 
the embryo. In addition, members of the Toll family play a key role in innate antibacterial and 
5 antifungal immunity in insects as well as in mammals. These proteins are type-I transmembrane 
receptors that share an intracellular 200 residue domain with the interleukin-1 receptor (IL-1R), 
the Toll/EL-IR homologous region (TIR). The similarity between Toll-like receptors (LTRs) and 
IL-1R is not restricted to sequence homology since these proteins also share a similar signaling 
pathway. They both induce the activation of a Rel type transcription factor via an adaptor protein 

10 and a protein kinase. Interestingly, MyD88, a cytoplasmic adaptor protein found in mammals, 
contains a TIR domain associated to a DEATH domain (see EPR000488 ). Besides the 
mammalian and Drosophila proteins, a TIR domain is also found in a number of plant proteins 
implicated in host defense. As MyD88, these proteins are cytoplasmic. Site directed mutagenesis 
and deletion analysis have shown that the TIR domain is essential for Toll and EL-1R activities. 

1 5 Sequence analysis have revealed the presence of three highly conserved regions among the 
different members of the family: box 1 (FDAFISY), box 2 (GYKLC-RD-PG), and box 3 (a 
conserved W surrounded by basic residues). It has been proposed that boxes 1 and 2 are involved 
in the binding of proteins involved in signaling, whereas box 3 is primarily involved in directing 
localization of receptor, perhaps through interactions with cytoskeletal elements 

20 Toll is a Drosophila gene essential for ontogenesis and antimicrobial resistance. Several 

hortologues of Toll have been identified and cloned in vertebrates, namely Toll-like receptors 
(TLR). Human TLR are a growing family of molecules involved in innate immunity. TLR are 
structurally characterized by a cytoplasmic Toll/interleukin-lR (TIR) domain and by 
extracellular leucine-rich repeats. TLR characterized so far activate the MyD88/IRAK signaling 

25 cascade, which bifurcates and leads to NF-kappaB and c-Jun/ATF2/TCF activation. Genetic, 
gene transfer, and dorninant-negative approaches have involved TLR family members (TLR2 
and TLR4) in lipopolysaccharide recognition and signaling. Accumulating evidence suggests 
that some TLR molecules are also involved in signaling receptor complexes that recognize 
components of gram-positive bacteria and mycobacteria. However, the definitive role of other 

30 TLR is still lacking. A systematic approach has been used to determine whether different human 
leukocyte populations selectively or specifically expressed TLR mRNA. Based on expression 
pattern, TLR can be classified as ubiquitous (TLR1), restricted (TLR2, TLR4, and TLR5), and 
specific (TLR3). Expression and regulation of distinct though overlapping ligand recognition 
patterns may underlie the existence of a numerous, seemingly redundant, TLR family. 
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Alternately, the expression of a TLR in a single cell type may indicate a specific role for this 
molecule in a restricted setting. 

The amino acids differences between the three MOL1 proteins are shown in Table 1H. 
Deletions are marked by a delta (A). The differences between the three proteins appear to be 
5 localized to a few distinct regions. Thus, these proteins may have similar functions, such as 
serving as olfactory or chemokine Teceptors. 
Uses of the Compositions of the Invention 

The above defined information for this invention suggests that this Toll Receptor-like 
protein may function as a member of a 'Toll Receptor family". Therefore, the novel nucleic 

10 acids and proteins identified here may be useful in potential therapeutic applications implicated 
in (but not limited to) various pathologies and disorders as indicated below. The potential 
therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targetmg/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 

15. research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing (but 
not limited to) those defined here. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in pancreatic cancer, adenoma, and other cancers, Larsen syndrome, 
tachycardia, erythroderma, night blindness, long QT syndrome, brugada syndrome, heart block, 

20 cell-mediated immunity, and applications as a mediator in inflammation and/or other pathologies 
and disorders. For example, a cDNA encoding the Toll Receptor-like protein may be useful in 
gene therapy, and the Toll Receptor-like protein may be useful when administered to a subject in 
need thereof. By way of nonlimiting example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from pancreatic cancer, adenoma, and other 

25 cancers, Larsen syndrome, tachycardia, erythroderma, night blindness, long QT syndrome, 
brugada syndrome, heart block, cell-mediated immunity, and applications as a mediator in 
inflammation. The novel nucleic acid encoding Toll Receptor-like protein, and the Toll 
Receptor-like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be assessed 

30 These materials are further useful in the generation of antibodies that bind immunospecifically to 
the novel substances of the invention for use in therapeutic or diagnostic methods. 
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MOL2 

An additional murine GNC2 elFK -like protein of the invention, referred to herein as 
MOL2, is an Olfactory Receptor ("OR")-like protein. Hie novel nucleic acid of 4989 
nucleotides, (20466828_EXT1, SEQ ID NO:3) encoding a novel GNC2 elFK-like protein is 
5 shown in Table 2 A. 



Table 2A. MOL2 Nucleotide Sequence (SEQ ID NO:3) 

ATGGCTGGGGGCCGTGGGGCCCCCGGGCG03GCCGGGACGAGCCTCCGGAGAGCTACCCGCAACGACAGGACCACGA 
GCTACAGGCCCTGGAGGCCATCTACGGCGCGGACTTCQ^GACCTGCG^CCGGACGCTTGCGGACCGGTTAAGGTCA 
AAGAGCCCCCTGAAATCAATT T AG TTTTGTACCCTCAAGGCCTAACTGGTGAAGAAGT ATATGTAAAAGTGGATTTG 
AGGGTTAAATGCCCACCTACCTATCCAGMGTAGTTCCTGAAATAGAGTTAAAAAATGCCAAAGGTCTATCAAATGA 
AAGTGTCAATT TGTTAAAATC TCGCCTAGAAGAACTGGCCAAGAAACACTGTGGGGAGGTAGTGATGAT CTTTGAAC 
TGGCTTACCACGTGCAGTCOTTTCTCAGCGAGCATAAC^ 

AG GC GG GCT C AAGAGGAG CAACAGAG GCTGTT GGAGG C CCAAGCGGAAAGAAGAGCAC AGCAACGT GAAAT CCT GCA 
TGAGATTCAG AG AAGGAAAG AAGAGAT AAAAGAAGAG AAAAAAAGG AAAGAAATGG CT AAG CAGGAACG T TT GG AAA 
TT GCT AG T T TGTCAAACCAAGAT CAT ACCT CT AAG AAGG AC C CAGGAGGACACAGAAC GGC TGCCAT T C T ACAT GGA 
GGCTCTCCTGACTTTGTAGGAAATGGTAAACATCGGGCAAACTCCTCAGGAAGGTCTAGGTTAAGGCGAGAACGTCA 
GTATTCTGTATGT AATAGT GAAGATT CTCCTGGCTCT TGTGAAATTCTGT ATT T CAAT ATGGGGAGTCCTGATCAGC 
TCATGGTGCACAAAGGSAAATGTATTGGCAGTGATGAACAACTTGG^ 
ACTG^TGGCTTTGrCTTGTTGTATGAGTGGGTCCTTCAGTGGCAGAAAAAAATGGGTC^ 
AAAAG AG AAGATT GATAAGTGCAAAAAGCAGATTCAAGGAACAGAAACAGAAT TC^^ 

ATCCAAAT GT AGT ACGCT ACCTTGCAATG AATTCAAAGAGCAAG ACGACTC CATTC GT GGTGGACATTTTAGT GGAG 

CACATTAGTGGGGTCTCTCTTGCTGCACACCTGAGCCACTCAGGCCCCATCCCTGTGCATCAGCTTCGCAGGTAC^ 

AGCTCAGCTCCTGTCAGGCCTTGATTArCTGCACAGCAATTCTGTGGTGCATAAGGTCCTGAGTGCATCTAATGTCT 

TGGTGGATGCAGAAGGCACCGTCJ^ATTACGGACTATAGCATTTCTAAGCGCCTCGC^ 

CTCTTTGAGCAAACCCGAGTTCGTTTTAGTGACAATGCTCTGCCTTATAAAACGGGGAAGAAAGGAGATGTTT 

TCTTGGCCTTCTGCTGCTGTCCCTCAGCCAAGGACAGGAATGTGGAGACTACCCTGTGACCATCCCTAGTG^^ 

CAGCTGACTTTCAAGATTTTCTAAAGAAGAGATGTGTGTGCTTGGATGACAAGGAAAGATGGAGTC 

TTGAAACACAGCTTTATAAATCCCCAGCCAAAAATGCCTCTAGTGGAACAAAGTCCTGAATCTGAAGGACAAGA 

TGTTGAGACTGTTATTCCTAGCAACCGGCTACCCAGTGCTGCCTTCTTTAGTGAGACACAGAGACAGTTTTCCCGAT 

ACTTCATTGAGTT TGAAGAATT ACAACTTCTTGGTAAAGGAGCT TTT GGAGCT GT CATCAAGGTGCAGAACAAGTTG 

GACGGCTG<:TGCTACGCaGTGAAGCGCATC:3C<M 

GACACTGCTGTCACGGCT GCACCATGAGAACATTGTGCGCT ACTACAACGCCT GGATCGAGCX3GCAC GAGCGGCCGG 
CGGGACCGGGGACGCCGCCCCCGGACTCCGGGCCCCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGC 
GACACAGACGGCCTGGACAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 
GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCACCGGCCCGGGCTCCAGCGATGACGAGGACGACGACGAGGACG 
AGCACGGTGGCGTCTTCTCCCAGTCCTTCCTt^CTGCTTCAGATTCTGAAAGTGATATTATCTTTGACAATGAAGAT 
GAGAACAGT AAAAG TCAG AAT C AG GAT GAAGATT GCAAT G AAAAGAATGGC T GC C AT G AAAGTGAG CC AT CAGT GAC 
GACTGAGGCT GT G C ACTAC CT AT ACAT CC AG ATGGAGTAC T G T GAGAAG AGCAC T T T ACGAGACACCAT T GAC CAGG 
GACTGTATCGAGACACCGTCAGACTCTG<3AGGCTTTTTCGAGAGATTCTGGATGGATTAGCTTATATCCATGAGAAA 
GGAATGATTCACCGGGATTTGAAGCCTGTCAACATTTTTTTGGATTCTGATGACCATGTGAAAATAGGTGATTTTGG 
ITTTGGCGACAGACCATCTAGCCTTTTCTGCTGA^ 

CTTCAGGTCACTT AACTGGGATGGTTGGCACTGCTCT CTATGTAAG CCCAGAGGTCCAAGGAAGC ACCAAAT CT GCA 

TAC^CCAGAAAGTGGATCTCTTCAGCCTGGGAATTATCTTCTTTGAGATGTCCTATCACCCCATGGTCACGGCTTC 

AGAAAG^ATCTTTGTTCTCAACCAACTCAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTGAC 

ATGCAAAGCAGAAATCAGTCATCTCCTGGCTGTTGAACCACGATCCAGCAAAACGGCCCACAGC 

AAGAGTGAGCTGCTGCCCCCACCCC^GATGGAGGAGTCAGAGCTGCATGAAGTGCTGCAC(^CACGCTGACCAACGT 

GGATGGGAAGGCCTACCGCACCATGATGGCCCAGATCTTCTCGCAGCGCATCTCCCCTGCCATCGATTACACCTATG 

ACAGCGAC2AT ACTGAAGGGCAACTT CTCAATCCGTACAGCCAAGAT GC AGCAGCATGT GTGTGAAACCATCATCCGC 

ATCTTTAAAAGACATGGTGCT GTTCAGTTGrcTACTCCACT ACTGCTTCCCCGAAACAGAC AAAT AT AT GAGCACAA 

CGAAGCTGCCCTATTCATGGACCACAGCGGGATGCTGGTGATGCTTCCTTTTGACCTGCGGGTGCCTTTTGCAAGAT 

ATCTGGCAAGAAAT AAT ATATTGAAT TTAAAACGGTACTGCAT AGAACGTGTGTT C^CGCCGCGCAAGT TAGAT CGA 

TTTCATCCCAAAGAACTTCTGGAGTGTGCCTTTGATATTGT^ 

AATTATCTACACT ATCTAT GAAATCATCCAAGAGTTTCC^GCACTTCAAGAAAGAAATTACAGTATTTAT TT GAACC 
ATACCATGTT ATTGAAAGCAAT ACT CT T ACACTGTGGGATCC CAGAAGAT AAAC TCAG T C AAGT C T ACAT T ATT CTG 
TATGATGCTGTGACAGAGAAGCTGACGAGGAGAGAAGTGGAAGCTAAATTTTGTAATCTGTCTTTGTCTTC 
TCTGT GTCGACTCTACAAGTT T ATTGAACAGAAGGGAGATT TGCAAGATCTT ATGCCAACAATAAATT C ATTAATAG 
AACAGAAAACAGG T ATTG CAC AGTT GGT GAAGTAT GG C TT AAAAG ACCT AGAGGAGGT T GT TGG AC T GT TGAAG AAA 
CTCGGCATCAAGTTACAGGTTTG£GTCTTGATC3^ 

CTTCCAGTTTGTGGCTATCATCiU^CGAAGGCAAAGGGCTGTACCTGAAATCCT 
TGCTGATTCCCCAGTTTAGAGGGCCACAAGCTCTGGGGCCAGTTCCCACT 
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AAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTAAGTTCTGTTACAATAGGCTCTGGGGACCTCCTGGTTGT 
T^AGTGTGGGCCAGATGTC T AT GTC CAGGGCCAT AAACCTAAC CCAGAAAC TCT GGACAGCAGGCAT CACAG CAGAAA 
TCATGT ACGACTGGTCACAGT T T C^GTCCCAAGAGGAATT ACAAG AGT AC TGC AGACATCAT GAAAT CACC T AT GTG 
GCCCTTGTCTCGGATAAAGAAGGAAGCCATGTCAAGGTTAAGTCTT^ 

GCTGGAGACTGAACTT GT GGACCATGT ACTGC AG AAACTGAGGACTAAAGTCACTG ATGAAAGGAATTT T AGAGAAG 
CTTCCGATMTCTTGCAGTGCAAAAT CTGAAGGGGT CATT TTCTAATGCT TCAGGTTTGTTTGAAATCC ATGGAGCA 
ACAGTGGT TCCCATTGTGAGT GT GCT AGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCT AT GAAACTCAGGT 
ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAG 

T AC CCAAAG AAACAAT AT T ACAG T T T TT ATCATT AGAGTGGGATGCT GAT GAACAGGCATT T AACACJ\AC TGTGAAG 
CAGCTGCTGTCACGCCTGCCAAAGCAAAGATACCTC^VAATTAGTCTGTGATGAA^ 

AAAGGTGTCTGTGCI ATTTCT GT ACAGCTATAGAGATGACTACTACAGAATCTTAT TTTAA 



An open reading fame (ORF) for MOL2 was identified from nucleotides 1 to 4986. The 
disclosed MOL2 polypeptide (SEQ ID NO:4) encoded by SEQ ID NO:3 is 1662 amino acid 
residues, has a molecular weight of 188250.1 and is presented using the one-letter code in Table 
5 2B. The SignalP, Psort and or Hydropathy profile of MOL2 indicate that this sequence does not 
have a signal peptide and is likely to be localized to the nucleus. Therefore it is likely that 
MOL2 is available at the appropriate sub-cellular localization and hence accessible for the 
therapeutic uses described in this application. 



Table 2B. Encoded MOL2 protein sequence (SEQ ID NO:4). 

MAGGRGAPGRGRDEPPESYPQRQDEELQALEAIYGADFQDL^ 

RVKCPPTYPDWPEIELKNAKGLSNESVNLLKSRLEELAKKH^ 

RRAQEEQQRLLEAQAERRAQQREILHEIQRRKEEIKEEKKR 

GSPDFVGNGKHRANSSGRSRLRRERQYSVCNSEDSPGSCEILY 

TGGEVLLYEWLQWQKKMGPFLTSQEKEKIDKC 

HISGVSLAAHLSHSGPIPVHQLRRYTAQLLSGLDYLHSNSV 

VFEQTRVRFSDNALPYKTGKKGDWRLGLLLLSLSQGQECC-EYPVTIPSDLPADFQDFLKKRCVCLDDKERWSPQQL 
LKHSFINPQPKMPLVEQSPESEG^DYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGAFGAVIKVQNKL 
DGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHEJJIVRYYNA^ 

DTDGLDSVEAAAPPPILSSSVEWSTSGERSASARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDIIFDNED 

ENSKSQNQDErcNEKNGCHESEPSVTTEAVHYLYIQMEYCE^ 

GMIHRDLKPWIFLDSDDHVKIGDFGLATDHLAFSADSKQDD^ 

YNQKVDLFSLGI I FFEMS YHPMVTASERI FVLNQLRDPTS PKFPEDFDDGEHAKQKS VISWLLNHDPAKRPTATELL 
KSELLPPPQMEE SELHE VLHHTLTNVDGBCAYRTMMAQI FSQRIS PAID YT YDS DILKGNFS IRTAKMQQHVCET I IR 
IFKRHGAVQI^TPLLLPRNRQIYEHNEAALFMDHSGML^LPFDLRVPFARYVARNNILNLKRYCIERVFTPRKLDR 
FHPKELLECAFDIVTSTTNSFLPTAE 1 1 YT I YE I IQEFPALQERNYS I YLNHTMLLKAILLHCGI PEDKLSQVYIIL 
YDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKFIEQKGEJI^DI^PTINSLIEQKTGIAQLVKYGLKDLEEWGLLK^ 
LGIKlQVWLINLGL\n(KVQQHNGIIFQFVAIIKRRQRAVPEIL^ 

K I SAAVLNMEES VS S VT I GS GDLLWS VGQMSMSRAINLTQKLWT AG IT AE IMYDWS QFQS QS ELQE YCRHHE I T YV 
ALVS DKEGSHVKVKS FEKERQTEKRVLETELVDHVLQKLRTKVT DERN FREASDNLAVQNLKGSFSNAS GLFE I HGA 
TWPIVSVLAPEKLSASTPnPJ*YETQVQTRLQTSLANLHQK^ 

QLL S RLPKQR YLKL VC DE I YNI KVE KKVS VIiFL Y5 YRDDYYR IL F 



10 The MOL2 nucleic acid sequence has 31 19 of 3723 bases (83 %) identical to aMus 

musculus GCN2 EIF2alpha kinase mRNA (GENBANK-ID: MMU243533|acc:AJ243533 ) 

The full amino acid sequence of the protein of the invention was found to have 479 of 
1662 bases (88 %) amino acid residues (88 %) identical to, and 1 554 of 1662 residues (93 %) 
similar to, the 1648 amino acid residue CAB58363 GCN2 EIF2alpha kinase protein from Mus 
15 musculus (ptnr: TREMBLNE W- ACC : CAB58363 ). 

Other BLAST results including the sequences used for ClustaTW analysis are presented in 
Table 2C. 
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Table 2C. BLAST results for MOL2 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


PatP:B65663 


Pro-ein 
kinase [Homo 
sapiens] 


1649 


1626/1662 
(97%) 


1632/1662 
(98%) 


0.0 


PatP:B43581 


Cancer 
associated 
protein [Homo 
sapiens] 


604 


592/609 
(97%) 


594/609 
(97%) 


6.5e- 
306 


r> *- o . n/m <T 1 

rdtr .OH£ fOl 


UK" i jZj 

polypeptide 




DO D/ Di4 

(93%) 


030/ Oil ft 

(95%) 


-L • JC 

300 


gi|10764165|gb|AAG2 
2591.11 (AF193344) 


GCN2garama 
[Mus 

mus cuius] 


1570 


1313/1577 
(83%) 


1375/1577 
(86%) 


0.0 


gi|11360320|pir| |T4 
6924 


probable 
translation 
iniLidLion 
factor elF- 
2 alpha kinase 

\ CjO £ • 1 - -L - ) 

[similarity] 

_ hitman 

(fragment) 


938 


887/946 
(93%) 


889/946 
(93%) 


0.0 


gi 1 7305017 |ref|NP 0 
38747.11 


GCN2 

eIF2alpha 
kinase [Mus 
mus cuius] 


1648 


1374/1647 
(83%) 


1442/1647 
(87%) 


0.0 


gi 1 10764161 1 gb| AAG2 
2589.11 


GCN2 alpha 
[Mus 

mus cuius] 


1370 


1189/1381 
(86%) 


1241/1381 
(89%) 


0.0 


gi 1 7243057 Idbj IBAA9 
2576.11 (AB037759) 


KIAA1338 
protein [Homo 
sapiens] 


1495 


1377/1460 
(94%) 


1380/1460 
(94%) 


0.0 


gi|107641631gb|AAG2 
2590.11 (AF193343) 


GCN2beta [Mus 
musculus ] 


1648 


1373/1647 
(83%) 


1440/1647 
(87%) 


0.0 


gi | 6065914 I emb | CABS 
8360.11 (AJ243428) 


putative eIF2 
alpha kinase 
[Homo 
sapiens] 


548 


505/556 
(90%) 


507/556 
(90%) 


0.0 



This information is presented graphically in the multiple sequence alignment given in 
5 Table 2D (with MOL2 being shown on line 1) as a ClustalW analysis comparing MOL2 with 
related protein sequences. 



Table 2D. Information for the ClustalW proteins: 

1) Novel M0L2 (SEQ ID NO: 4) 
10 2) gi|10764165|gb|AAG22591.1| (AF193344) GCN2gamraa [Mus musculus] (SEQ ID 

NO: 35) 

3) gi|11360320|pir| IT'46924 probable translation initiation factor elF- , 
2alpha kinase (EC 2.7.1.-) [similarity] - human (fragment) (SEQ ID NO:36) 

4) gi 17305017 | ref | NP_038747 . 1 1 GCN2 eIF2alpha kinase [Mus musculus] (SEQ 
15 ID NO: 37) 

5) gi I 7243057 Idbj | BAA92576. 1| (AB037759) KIAA1338 protein (Homo sapiens] 
(SEQ ID NO: 38) 

6) gi|10764163|gb|AAG22590.11 (AF193343) GCN2beta [Mus musculus] (SEQ ID 
NO: 39) 
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7) gi| 6065914 | emb | CAB58360.il (AJ243428) putative eIF2 alpha kinase [Homo 
sapiens] (SEQ ID NO: 40) 



MDL2 Prote 

AF193344 

T46924 

NP 038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



10 20 30 40 50 60 

. . . . I I ....I. ...I ....|. ...| ....|. ...| ....|.. ..| ....|....| 

MAGGRGAPGR GRDEPPESYP QRQDHELQAL EAIYGADFQD LRPDACGPVK VREPPEINLV 
MRTQRAL 



MAGGRGASGR GRAEPQESYS QRQDHELQAL EAIYGSDFQD LRPDARG — R VREPPEINLV 



LLL 

MAGGRGASGR GRAEPQESYS QRQDHELQAL EAIYGSDFQD LRPDARG — R VREPPEINLV 



60 

7 

1 

58 

1 

3 

53 
1 



70 80 90 100 110 120 

| ....|....| ....|....| ,...|.... | ,...|....| 

MOL2 Prote LYPQGLTGEE VYVKVDLRVK CPPTYPDWP EIELKNAKGL SNHSVNfflK0 RLEELAKKHC 120 

API 9334 4 LVP EIELKNAKGL SNE^SVnHk@ HLEELAKKQC 40 

T46924 1 

NP_038747. LYPQGLAGEE VYVQVELQVK CPPTYPDWP EIELKNAKGL SNE3,VN2jKg HLEELAKKQC 118 

AF193343 LYPQGLAGEE VYVQVELQVK CPPTYPDWP EIELKNAKGL SNESVN^k| HLEELAKKQC 118 



130 140 ISO 160 170 180 

....I.... I ....|.... | ....|.... | I .... I t .... I ....I.... I 

M0L2 Prote GEWMIFELA YHVQSFLSEH NK^PKSFHE EMLERRAQEE QQRLLEAQAE RRAQQREILH 180 

AF193344 G-EVMIFELA HHVQSFLSEH NK^PKSFHE EMLERQAQEK QQRLLEARRK EEQEQREIl| 99 

T46924 1 

NPJ338747. G-EVMIFELA HHVQSFLSEH NK@§PKSFHE EMLERQAQEK QQRLLEARRK EEQEQREILj] 177 

AAG22589.1 1 

AB037759 ^ FOS 23 

AF1 93343 G-EVMIFELA HHVQSFLSEH NK^PKSFHE EMLERQAQEK QQRLLEARRK EEQEQREILjJJ 177 



MOL2 Prote 

AF193344 

T46924 

NP 038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



190 



200 



210 



220 



230 



240 



EIQRRKEEIKlSEKKRKEMAt 

riqrrkeeikIsskkrkehaf 




MOL2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



250 



260 



270 



280 



290 



300 




300 
217 
1 

295 

17 

141 

295 

1 



MOL2 Prote 
AF193344 
T46924 
NP_038747 . 
AAG225B9 . 1 
AB037759 
API 93343 
AJ243428 




M0L2 Prote 

AF193344 

T46924 

NP_038747 . 

AAG22589.1 

AB037759 

AF193343 



370 



380 



390 



400 



410 




SLMjHLSHSG 




420 
I I 

420 
336 

414 
136 
261 
414 



TAQLLjjGLDY 

taqilSgldy 
taqliSgldy 
taqll2gldy 
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AJ243428 



M0L2 Prote 

&F193344 

T46924 

NP 03B747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP 038747. 

AAG22589.1 

AB037759 

AP193343 

AJ243428 



MDL2 Prote 

AF193344 

T46924 

NPJJ38747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



M0L2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589.1 

AB0377S9 

AFX93343 

AJ243428 



M0L2 Prote 

AF193344 

T46924 

NPJ)38747. 

ARG225B9.1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 

AF193344 

T46924 

NP 038747. 

A&G22589.1 

AB037759 

AP193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP_038747. 

A&622589.1 

AB037759 

AP193343 

AJ243428 



430 
..I • 



440 
..I . 



450 
..I . 



460 
..I 



470 
..I . 



— 1 



480 
.J 



lksSsvvhkvIlsassvlvdaIegtvkitdys 
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850 

.-I . 



. I . 



860 
.-1 . 



.1. 



870 
..I - 



680 



MDL2 Prote 

AF193344 

T46924 

NP_038747. 

&&G22589.1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 

AP193344 

T46924 

NPJ53B747. 

A&G22589 . 1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



M0L2 Pro to 

AF193344 

T46924 

NPJJ38747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 

AF193344 

T46924 

NPJ>38747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



M0L2 Prote 

RF193344 

T46924 

NP 038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



GLAl'IHKKGM 
GLAYIHEKGM 
GLAYIHEKGt-5 
GLAYIHEKGH 
GLAYIHEKGM 
GLAYIKEKGM 
GLAYIHEKGE-' 



IHKDLKPVgtt 

IHRDLKPvffll 
IHHDLKPvSl 
IHRDLKPVgjl 
IHRDLKPvSl 
IHRDLKPvSl 



FLDSDDHVKI 
FLDSDDHVKI 
FLDSDDHVKI 
FLDSDDHVKI 
FLDSDDHVKI 
FLDSDDHVKI 
FLDSDDHVKI 



GDFGLATDH1 
GDFGLATDHL 
GDFGLATDHL 
GDFGLATDKL 
GDFGLATDKI 
GDFGLATDHL 
GDFGLATDHL 



890 900 
!...,| 1 I 

BtS3-L 
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.1 . 



1270 
..I 



MOL2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589.1 

AB037759 

ATI 93343 

AJ243420 



lcrlykfieq 
lcrlykfieq 
lcrlykfieq 
lcrlykfieq 
lcrlykfieq 
lcrlykfieq 
lcrlykfieq 

LCRLYKFIEQ 




1280 
..I .. 



.1. 



1290 
..I 



1300 
.1 



.1. 



?T IlT:ST.!£2XTG 
PT iBsLIKQKTG 
PT IijSLIKQKTG 1 
PT IgSLIKQKTG 
?T IgSLIKQKTG 
?T IBSLIKQKTG 
?T iBsLIKQKTG 



?T| IBSLIKQKTG 



MOL2 prote 








AF193344 




E 




T46924 




K 


"/}■ 


NP 038747. 


S 




.". - 


AAG225B9.1 


s 




*i 


AB037759 


L 




r- 


AF193343 


S 






AJ243428 









1330 



1340 

-.1 .. 



1350 




QQHBpiIFQF 

qqhSgiifqf 

QQHgGIIFQF 

qqhtgiifqf 
qqhSgiifqf 
qqkSgiifqf 
qqkSgiifqf 
qqnEgiifqf 




DLF.F.WG 
DLEEWG 
DLEEWG 

dldJvvg 

DLEEWG 
DLEEWG 
DLEEWG 
DLEEWG 



.1. 




YDLLIPSfc'RG 
YDLLIPgFRG 

ydllipSfrg 
ydllip&frg 

YDLLIPjjSFRG 
vDLLIPgtRG 

ydllipSfrg 
ydllipsfrg 



pqalgpvpta 
pqalgpvpta 
pqalgpvpta 
pqJJBgpvptp. 
pqalgpvpta 
pqalgpvpta 
pqalgpvpta 
pqalgpvpta 



1378 

1292 

660 

1370 

1092 

1217 

1370 

270 



.1. 



MDL2 Prote 

AF193344 

746924 

NP_03B747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



C-VSIAIDKI 
GVSIAIDKI 
GVSIAIDKI 
GVSIAIDKI 
GVSIAIDKI 
GVSIAIDKI 
GVSIAIDKI 
GVSIAIDKI 



1390 1400 1410 1420 

I ....I. ...| ...,!....) . 

- - - "GiGa - 





. I. 



1430 
..I 



1440 
..I 



LLVVSVGQMS 
LLWSjjGQKS 
LLVVSVGQMS 
LLVVSVGQMS 
LLVVSVGQMS 
LLVVSVGQMS 
LLVVSVGQMS 
LLVVSVGQMS 



MSRAZHLTQK 
MSRAigLTQK 

msraiHltqk 
msraiSltqk 
msraiHltqk 
wsraiHltqk 
msraiHltqk 
msraiHltqk 



LWTAGITAEI 
LWTAGITAEI 
LWTAGITAEI 
LWTAGITAEI 
LWTAGITAEI 
LWTAGITAEI 
LWTAGITAEI 
LWTAGITAEI 



1450 1460 
.1 ....I. ...I . 



1480 



SELQEYCRKIJ 
23LQEYCREK 
E3LQEYCRKK 
EELQEYCRHK 
EELQEYCRKK 
3ELQEYCRKH 
EELQEYCRKK 
EELQEYCRKK 



EITYVALVSD 
EITYVALVSD 
EITYVALVSD 
EITYVALVSD 
EITYVALVSD 
EITYVALVSD 
EITYVALVSD 
EITYVALVSD 




1490 
• •I . 



FEKERQTEKR 
FEKERQTEKR 
FEKERQTEKR 
FEKERQTEKR 
FEKERQTEKR 
FEKERQTEKR 
FEKERQTEKR 
FEKERQTEKR 



MOL2 Prote 

AF193344 

T46924 

MP_038747. 

AAG22589 . 1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 
AF193344 
T46924 
NPJ)3B747. 
AAG22569 . 1 
AB037759 
AF1 93343 
AJ243428 



MOL2 Prote 

AF193344 

T46924 

NPJJ38747. 

AAG22569 . 1 

AB037759 

AF193343 

AJ243428 



RRRHEfiQVQT 
P.RRSEflQVQr 

rrrReWqvqt 
rreSemqvqt 



RRRHEfiQVQT 
P.RRSEflQVQr 

rrrReWqvqt 

RRF.gEMQVQT 
RRRgEgQVQT 

rrrReaqvqt 
rrrSemqvqt 
rrfReSqvqt 



. I . 



LLSRLPKQRY 
LLSRLPKQRY 
LLSRLPKQRY 
LLSRLPKQRY 
LLSRLPKQRY 
LLSRLPKQRY 
LLSRLPKQRY 
LLSRLPKQRY 




qksseieila 1 
qksseieilaI 

QKSSEIEILAj 
QKSSEIEILA! 
QKSSEIEILA 
QKSSEIEILA 
QKSSEI3ILA 
QKSSEIEILA 



• I- 



IKVEKKVSVL 
IKVEKKVSVL 
IKVEKKVSVL 
IKVEKKVSVL 
IKVEKKVSVL 
IKVEKKVSVL 
IKVEKKVSVL 
IKVEKKVSVL 





qflslewdad 
qflslewdad; 
qflslewdad: 

QFLSLEWDAD 
QFLSLEWDAD 
QFLSLEWDAD 
QFLSLEWDAD 
QFLSLEWDAD 
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Chromosomal information 

MOL2 belongs to genomic DNA [Acc.NO.: AC0251 68 from GenbankNEW]. Within this 
GenbankNew entry was a note showing that the sequence was from Chromosome 15ql4. 
5 Therefore we assign the chromosomal locus of this invention as Chromosome 15ql4. 



Tissue expression 

MOL2 is expressed in at least the following tissues: brain and liver (derived from 
literature sources) and thyroid (derived from 20466828_EXT1). 

10 Based on information available on expression of SWISSPROT-ACC:P29089 TYPE-IB 

ANGIOTENSIN II RECEPTOR (AT1B) (AT3) - Rattus norvegicus (Rat), the closest G-protein 
coupled receptor family member it is likely that MOL2 is expressed in cardiac tissue, renal 
tissue, and vascular tissue as angiotensin is expressed in these tissues.MOL2 has similarity to the 
murine GNC2 elFK protein, a possible GNC2 elFK and other GNC2 elFKs and their functions 

15 as described in but not limited to the references below: In eukaryotic cells, protein synthesis is 
regulated in response to various environmental stresses by phosphorylating the alpha subunit of 
the eukaryotic initiation factor 2 (eIF2alpha) (Berlanga, etal., Eur J Biochem; 265(2):754-62; 
Oct,1999). Three different eIF2alpha kinases have been identified in mammalian cells, the heme- 
regulated inhibitor (HRI), the interferon-inducible RNA-dependent kinase (PKR) and the 

20 endoplasmic reticulum-resident kinase (PERK). A fourth eIF2alpha kinase, termed GCN2, was 
previously characterized from Saccharomyces cerevisiae, Drosophila melanogaster and 
Neurospora crassa. Berlanga et al. 1999 describe the cloning of a mouse GCN2 cDNA 
(MGCN2), which represents the first mammalian GCN2 homolog. MGCN2 has a conserved 
motif, N-terminal to the kinase subdomain V, and a large insert of 139 amino acids located 

25 between subdomains IV and V that are characteristic of the known eIF2alpha kinases. 

Furthermore, MGCN2 contains a class II aminoacyl-tRNA synthetase domain and a degenerate 
kinase segment, downstream and upstream of the eIF2alpha kinase domain, respectively, and 
both are singular features of GCN2 protein kinases. MGCN2 mRNA is expressed as a single 
message of approximately 5.5 kb in a wide range of different tissues, with the highest levels in 

30 the liver and the brain. Specific polyclonal anti-(MGCN2) immunoprecipitated an eIF2alpha 

kinase activity and recognized a 190 kDa phosphoprotein in Western blots from either mouse 

liver or MGCN2-transfected 293 cell extracts. Interestingly, serum starvation increased 

eIF2alpha phosphorylation in MGCN2-transfected human 293T cells. This finding provides 

evidence that GCN2 is the unique eIF2alpha kinase present in all eukaryotes from yeast to 
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mammals and underscores the role of MGCN2 kinase in translational control and its potential 
physiological significance. 

A family of protein kinases regulates translation in response to different cellular stresses 
byphosphorylation of the alpha subunit of eukaryotic initiation factor-2 (eIF-2alpha). In yeast, an 
5 eIF-2alpha kinase, GCN2, functions in translational control in response to amino acid starvation. 
It is thought that uncharged tRNA that accumulates during amino acid limitation binds to 
sequences in GCN2 homologous to histidyl-tRNA synthetase (HisRS) enzymes, leading to 
enhanced kinase catalytic activity. Given that starvation for amino acids also stimulates 
phosphorylation of eIF-2alpha in mammalian cells, we searched for and identified a GCN2 

10 homologue in mice. Sood et.al., 2000 cloned three different cDNAs encoding mouse GCN2 
isoforms, derived from a single gene, that vary in their ammo-terminal sequences. Like their 
yeast counterpart, the mouse GCN2 isoforms contain HisRS-related sequences juxtaposed to the 
kinase catalytic domain. While GCN2 rnRNA was found in all mouse tissues examined, the 
isoforms appear to be differentially expressed. Mouse GCN2 expressed in yeast was found to 

1 5 inhibit growth by hyperphosphorylation of eIF-2alpha, requiring both the kinase catalytic 

domain and the HisRS-related sequences. Additionally, lysates prepared from yeast expressing 
mGCN2 were found to phosphorylate recombinant eIF-2alpha substrate. Mouse GCN2 activity 
in both the in vivo and in vitro assays required the presence of serine-51, the known regulatory 
phosphorylation site in eIF-2alpha. Together, those studies identify a new mammalian elF- 

20 2alpha kinase, GCN2, that can mediate translational controLPhosphorylation of the alpha subunit 
of eukaryotic initiation factor 2 (eIF-2alpha) is a well-characterized mechanism regulating , 
protein synthesis in response to environmental stresses (Yang et al., Mol Cell Biol; 20(8):2706- 
17; Apr, 2000) . In the yeast Saccharomyces cerevisiae, starvation for amino acids induces 
phosphorylation of eIF-2alpha by Gcn2 protein kinase, leading to elevated translation of GCN4, 

25 a transcriptional activator of more than 50 genes. Uncharged tRNA that accumulates during 
amino acid limitation is proposed to activate Gcn2p by associating with Gcn2p sequences 
homologous to histidyl-tRNA synthetase (HisRS) enzymes. Given that eIF-2alpha 
phosphorylation in mammals is induced in response to both carbohydrate and amino acid 
limitations, we addressed whether activation of Gcn2p in yeast is also controlled by different 

30 nutrient deprivations. It was found that starvation for glucose induces Gcn2p phosphorylation of 
eIF-2alpha and stimulates GCN4 translation. Induction of eIF-2alpha phosphorylation by Gcn2p 
during glucose limitation requires the function of the HisRS-related domain but is largely 
independent of the ribosome binding sequences of Gcn2p. Furthermore, Gcn20p, a factor 
required for Gcn2 protein kinase stimulation of GCN4 expression in response to amino acid 
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starvation, is not essential for GCN4 translational control in response to limitation for 
carbohydrates. These results indicate there are differences between the mechanisms regulating 
Gcn2p activity in response to amino acid and carbohydrate deficiency. Gcn2p induction of 
GCN4 translation during carbohydrate limitation enhances storage of amino acids in the 
5 vacuoles and facilitates entry into exponential growth during a shift from low-glucose to high- 
glucose medium. Gcn2p function also contributes to maintenance of glycogen levels during 
prolonged glucose starvation, suggesting a linkage between amino acid control and glycogen 
metabolism. 

1 0 Uses of the Compositions of the Invention 

Hie expression pattern, map location and protein similarity information for MOL2 

suggest that it may function like a member of the GCN2 elFocK family. Therefore, the nucleic 
acids and proteins of the invention are useful in potential therapeutic applications implicated, for 
example but not limited to, in various pathologies /disorders as described below: 



15 



Hyperthyroidism 
Hypothyroidism 

Von Hippel-Lindau (VHL) syndrome 
Alzheimer's disease 



20 



Stroke 



Tuberous sclerosis 



30 



25 



Hypercalcemia 
Parkinson's disease 
Huntington's disease 
Cerebral palsy 
Epilepsy 

Lesch-Nyhan syndrome 
Multiple sclerosis 
Ataxia-telangiectasia 
Leukodystrophies 



Behavioral disorders 



Addiction 



Anxiety 



Pain 
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• Cirrhosis 

• Transplantation 

• and/or other pathologies/disorders. 

5 Potential therapeutic uses for the invention(s) are, for example but not limited to, 

the following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 

10 tissues and cell types derived from these tissues). 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies and disorders. For example, but not limited to, a cDNA encoding the GCN2 eBFaK 
-like protein may be useful in gene therapy, and the GCN2 elFaK -like protein may be useful 

1 5 when administered to a subject in need thereof. By way of noruimiting example, the 

compositions of the present invention will have efficacy for treatment of patients suffering from 
Hyperthyroidism, Hypothyroidism, Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, 
Stroke, Tuberous sclerosis, Hypercalcemia, Parkinson's disease, Huntington's disease, Cerebral 
palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, 

20 Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection Cirrhosis, 
Transplantation and/or other pathologies/disorders. The novel nucleic acid encoding the GCN2 
elFaK-like protein, and the GCN2 elFaK -like protein of the invention, or fragments thereof, 
may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
acid or the protein are to be assessed. These materials are further useful in the generation of 

25 antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

MOL3 

An additional protein of the invention, referred to herein as MOL3, is a human 
30 complement C3-like protein. The novel nucleic acid was identified by TblastN using CuraGen 
Corporation's sequence file for MOL probe or homolog, run against the Genomic Dairy Files 
made available by GenBank. The nucleic acid was further predicted by the program GenScan™, 
including selection of exons. These were further modified by means of similarities using 
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BLAST searches. The sequences were then manually corrected for apparent inconsistencies, 
thereby obtaining the sequences encoding the full-length protein. The novel nucleic acid of 4894 
nucleotides (82254077.0.1, SEQ ID NO:5) encoding a novel olfactory receptor-like protein is 
shown in Table 3 A. An open reading frame (ORF) was identified beginning with an ATG 
5 initiation codon at nucleotides 1-3 and ending with a TGA codon at nucleotides 4837-4839. 
Putative untranslated regions downstream from the termination codon are underlined in Table 
3A, and the start and stop codons are in bold letters. 



Table 3A. MOL3 Nucleotide Sequence (SEQ H> NO:5) 

ATCTCCCCCTCTCTCCCTTTCTCTGCCAT^ 

TACATCCTCGTGACCCCCCGAGTTCTGAGGCTTGGCAGTCCGGAGAGCA^ 
AGACAGCCCCTCACAAGGACCCTCAAGGTGAACCT 

AGGAGCCAGCTCATTCTCTCACCAGGAAACAACTTTATGGACCAGGCACCTGTGACGGTTCCCGAGAGCCTGATG 
TACCTCCCAAAACCAGGGCAGCAATATGTCATCATCCGGGCAACTTGGGCACCCACCTCGGGCTCCTCATTCATG 
GAGAAGATGGTGCTGGTGGCTCTTCATGCTGGCTACATCTTTATCCAGACGGAGAAGACCATCTACACCCCTTCT 
CCCCTAGrTCACTACCGGGTGTTCACTGTGAACCAC7\AGATGGACCCTGTGACCAGGACATTCACTCTGGACATC 
AAGAATCCTGATGGGTCCCCAGCTTCCAGAGTCCTTGT 

T GGGGCCCTGAAAAGT CCCT GTGCCT CAGTTT GGGGACCTGGACC ATCGAAG CCAGCT ACCAAAGTACACCCAAG 
CAGAAGTTCGAGGCTGCCTTTGATGTGAAGGAATATGTCCTCCCATCTTTTGAGGTCCAGCTGGTCCCAAATAAG 
ACTTTCTTTTACCTCAAGGATGAGGCTCTGGGCGTTGACATCCAGGCTCGGTATATATTTAACAAGCCAGTGGAC 
GGACATGCTTTGGTCATCTTTGGGGTGAZ^ATTGGACTCCTGCCGGATCCCTATCCAAAGCTCCCTGCASAGGGTG 
GAG G T GACT GAAACAGGGGGT GAGAT GGT G C AAG CT GAGAC C TCAGGGGT GAAGATCAT C C AGAG C CC ATACAAC 
ATCAAGrrCACCAGGACACCCCAGTATTTCAAGCCAGGAATGCCCTTCCACTTTCGGGTCAGAGTCGTACAAAGC 
AG TCCT ATTCAGATCATATTCCAGTCTCACCT CT CAC ACCAGGCC ACTGCAGGCTTTT CCTTCACCTTACCCCAG 
ATTCCACCTCAGGTCTTCATCTQVAATCCTGATGGGTCCCCAGCTTCCAGAGTCCTTGTCCACTCCCAAGACCAA 
AAAGTGTACACCTCAGCTGAGGGGTTGGCCACTCTGACCATCAACACAGATGCAAATCTGGACAAGCTCCCCATC 
GAGGT GAAAACTGAGGAATCTCTTCAGCC AGAGGAGCAGGCT TCAGCC AAGAT GACAG CTTG GC CTTACTTGACT 
CAGGATGGGTCAGGAAACTTCCTACACAT CGAAGTAAAGACATTG GGC AC AGAGGTTGGCAGCAGCATCCAGCTG 
AGCCTCAACACAAGGCATCAGGACCCTAAAACCAAGGACAAGATTACTCACTTCACCATCCTGGTGGTCAGGGAG 
GGTAAGGCCCGGCAGCTTGGGAGGCAGGTGGCGCAGGTGGGGGTACCCTCCTTCCGCATTCTGGCCTTTTATTTA 
CTTCCCAGAGGAGCAAGCCAAGACCCTGAGTT GGTGGCTGATTCC AT ATGGAT TGATGTGAATGAC AGATGCAT A 
GGGCTGAAAGTTGGCTTGAAGAATGATAGATTCTTCCAGTCTTTGGAGCCCAACAGCCAAGTCGAACTGAAGGTG 
ACAGGTGATGCAGAAGCCACAGTGGGGCT GGT GG CTGTGGACAAG GCT GTCTATGTCTTGAACAGC AAACACAAG 
CTCACTCAGAAGAAGGTATGGAATGTGGTGGAGGAACATGACATTGGCTGCACAGGAGGAAGTGGGAAAGACAGA 
T TTGCTGTGTTCAAGGATG CTGGATTGGACCT GAAAATCAGCACAGGAATGGATAGCGGCCACC AG CAAAGTCAC 
AGCTGCCAGG AGGCTGAGGT GGGAG AATC ACT TGAACCCGGGAGGCAGAG GT TGCGGTCAGCCGAGATT GCGCCA 
CTGCACTCCAGCGTGAACAAGTTTAAGACAGAGCTGGAGCAAAAGTGCTGTGAGGCTGGGCTCCGGGAGAGCCCA 
GTGGGGCTGTCTTGTGAGGAGAGGACCTGGCATGTCCGCCATGGTCCAGCCTGTGTGGCTGCTTTCCTGGACTGC 
TGCTC ACACCTGCTCCCT CCAGCGGAT GAAGAAGAGGACTTCGATG ACCT CTTCTTGGATG ACATGCCTGTGCGG 
ACCTTGTTCCCCGAGAGT TGGCTCTGGAACAG C ATCT CCC ATTACCCC AT CTCT GTGAAGGT GCCAGAT TCCATC 
ACCACGTGGCAGTTTGTGGTGGTCAGCCTCAAG<;CTGGACAAGGTGGTCTCTGTGTCTCGGACCCCTTTGAGCTG 
ACAGTTATGAAATCGTTCTTTGTGGACCTTAAGTTGCCCTCCTCCGTGATCAGGAATGAGCAGGTCCAGATCCAA 
GCCATGTTGTACAATTTCAGGGATCGCCAGGCCAAGGTCCGAGTGGAGTTCCCCCACAAGGAGACACTGTGCIAGT 
GCGTCAAAGCCAGGAGCACCATCCCACCAGGTAGTGGTCGTGCX2CCCCACCTCCTCCAAGATAGTACACTTTGTG 
CTT CT CCCTCTGGAGACAGGCAAAGTGGACGTGGAGGTCAAGGCT GTGGGCTACGGGGTCCAGGACCAT GTGAAG 
AAGACACTCTTGGTCGAAGGTTGTGGTTATTCAGGTCAGACCCAGACAAAACTGGTGCCAAGACAGGAGTTCTTG 
AACAT GGT ACCCGACACGGAGGCGGAAGT GTTT ATC AGTGTTCAAGGTGACATCCTTGGTGAGACAATTGTGGGC 
AGCCTGACACCCAGTGAGATTCAGCAGCTGCTGCGGGTCCCCACGGGCTGCCCTGAGCAGACGCTGAGCTCCCTG 
ACGCCCGTCATOVTCCTGTCCCGCTATTTGGATACCACCGGCCAGTGGGGCAAGGTCGGGGTGGAGCACAGGGAC 
C^GGTGATGAAGAATATTGGCTACACTCAGATGCTGACCCACCXSGAGTTCAG^ 

GGGAACCCAGGAAGCACTTGGCTCACAAGCTATGTGTTCCGCGTCTTTGCCCTGGCCTACTCTATGATGACGACC 

CAAGTGCT TAGCCTGTCCTCT CTCTGTGACAT GGCCAACTGGATCAT CATCG ACAGGCAGG CAGAGGATGGGCAC 

TTCCTGGAGAAGGGCCCTGTGGTCATGACATCCATGTCCGAGGAGGATGTATCCCTC^CAGCTCTTGTCCTAATA 

GCCCTGAATGAGGGAAAGGAGTTGTGCAGACAGAAGGTAGGACCCAATTTGATGGCGAGCATCGAGAAGGCCGGA 

GGATTGCTTGAGCTCAGGAGGTTGAGGCTGCAGCGGAGCTATGCCGTAGCCATAGCCTCCTATGCACTGGCCGAC 

AAAACCCACT GGCCAGTGGATGAGCAGAAT CTGGGCTCCCT GTACACCATT GAGGCCACAGCCT ATGGGCTCATG 

CAGAAGCTGGAGCTGGGCCGGTACAATGAGACACACGCCATAGCCAAGTGGCTACTAGAGAAGCAGGAGCTGGGA 

GGAGGCTTCAGGTCCACCCAGCCAGGCAGGAGCAGTCGCCTTTCCCAC^ 

NTGGCCCTTGAAGCTCTGACCCGCTTCCXSCGAAGCTOTCCCCTTCAAGGGCATCC^ 

AGAGCCCCCAAGACAGCCCTGAAT GTGAATT GGT ACATTGATCACAGCAATGCCTACCAACAGCGGTCAGCAAAG 
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TT CCTTGCCCAGG ACG ACCTAGAGATCAAAGCCAGTGGCAACGGGAGAGGCACCATCTCGATCCT GACAATGT AT 
CACAAGTCCCC^GAGTCCCGGGAGGACAACTGCAACCTGTACCACCTGAATGCGACTC 

GAAAAT AAAAAGG GAGGT GAGACTTT TCGGC T CC GGATGGAAACAAGG TT CCAGAACAATGGAGAG GCCACAAT G 
ACT AT CATGGAGG T CT CCCTGCTCACGG GCT? CT ACC CCAACCAGGAT GACC TCAAACAGC T CACG AGT GATGT G 
GAGAGCTACGCCTTTCAGTACAAAACCAAGACAAGTACCAGCGACAGCACTGTTGTCCTCTAC^ 
TCCCATGAGAAGAACACGGTGCT<K3GCTTTCGGGTTCAC^ 

GTCACC^TCTACGACTACTACGAGCCTTCCCGGAGGTGCAGCACTTTCTACAACCTGCCCACAGA 
CTGAGAAAGATCT GCCACAAAGACAT CTGCAGAT GTGCAGAGGGACAGTGCCCATCCCTGCAGAAGCCCAGTGGC 
CAATTGAGGCAGGAGGAGCTC CAGACAACAGCAT GTGAGGCAGGCGT GGATTTTGTGT ACAAGACAAAGCTGGAA 
TCTGTGGAGGTCTCTGCCTCCAACCCTTACGTCTATTACAACACGCAGCTC 
GACCCTGCCAAACCCCTGGCCAIXSAAGAAATTCGTC^ 

CAGGAATCGTACCTCATCATGGGCCAGACGTCAGACCTGTGGAGAATCAAATCTGATTA(^GCTATGTTCTGGGC 
AAGGAGAC GTTCCT CAT CCT TT GGCCAGCAGATG GAG AT GCCAGC AAG AAAGAATT GC GGGACCAACT GGAGGAA 
TT?TTGGAATATATGCGCACCCAC<^CTGCCAGTC£T^ AGC^ 
CTCTGGGCCACGTGGGTTT 



The disclosed MOL3 polypeptide (SEQ ID NO:6) encoded by SEQ ID NO:5 is 1612 
amino acid residues, and is presented using the one-letter code in Table 3B. The MOL3 protein 
were analyzed for signal peptide prediction and cellular localization. SignalP results predict that 
5 MOL3 is cleaved between position 20 and 21 of SEQ ID NO:6. Psort and Hydropathy profiles 
also predict that MOL3 contains a signal peptide and is likely to be localized at the endoplasmic 
reticulum (certainty of 0.5500). 



Table 3B. Encoded MOL3 protein sequence (SEQ ID NO:6). 

MSPSLPFSAISLCLSHLPMPLPSPRYILVTPRVI^VGSPE 

ARSQLILSPGNNFMDQAPVTVPESLMYliPKPGQ^WIIR 

PSPLVHYRVFTVNHKJTOPVTRTFTLDIKNP 

TPKQKFEAAFDVKEYVLPSFEVQLVPNCT FGVKLDSCRIPIQSS 

LQRVEVTETGGEMVQAETSGVKI IQSPYNIKPTRTPQYPKPGMPFHFRVRWQSSPIQI IFQSHLSHQATAGFS 

FTLPQI PPQVFI SNPDGSPASRVLVHSQDQKVYTSAEGLATLT INTDANLDKLP IE VKTEESLQPBEQASAKMT 

ATOYLTQDGSGHFIjHIEVKTI^TEVGSSIQLSLNTRHQDPKTKDKITHFT 

RILAFYIiLPRGASQDPELVADSIWIDVNDRCIGL^^ 

YVLNSKmajTQKKVWOTVEEEDIGCTGGSGOT 

RLRSAEIAFLHSSVNKFKTEIjEQKCCEAGLRESPVGLSCEERTWHVRHGPACVAAFI^ 

dlflddmpvrtlfpe swlwnsi shyp is vkvpds ittwqfvwslkagqgglcvsdpfeltvmksffvdlklps 
svirneqvqiqa^ynfiidrqakvrvefph^cetlcsa^ 
kavgygvqdhvkktllvegojysgqtqticl^ 
vptgcpeg/rlssltpviilsryldttgqwgkvgvehrdqvm^ 

VFRVFAIiAYSMMTTQVLSLSSLCDMANWI I IDRQAEDGHFLEKGPWMTSMSEBDVSLTALVIiIAIjNEGKELCR 
QKVG PNLMAS I E KAGX3LLELRRLRLQRS YAVAIASYALADKTHWP VDEQNLGSL YT I EATAYGLMQKLELGR YN 
ETHAIAKWLLEKQELGGGFRSTQPGRSSRLSHPQRWPQGSLXALEAL^ 

VNWYIDHSNAYQQRS AKFLAQDDLE I KA SGNGRGT I S I LTMYHKS PE SREDNCKLYHLNATLHSALEENKKGGB 
TFRLRMETRFQNNGRATMTIMEVSLLTGFYPNQDDLKQLT^ 

VLGFRVHRMLQAEFLQAALVTI YD YYE P S RRCSTFYNLPTE QS SLRKI CHKDI CRCAEG QCPSLQRP S GQLRQE 
ELQTTACEAGVDFVYKTKLESVEVSASNPYVYYNTQLED 1 1 KSGTOPAKPLAMKKFVSHATCHDSLGLQBQESY 
LIMGOTSDLWRIK5DY3YVLGKETFLILWPADGDA5KKEIiRDQLEE?LEYMRTHGCQS 



The full amino acid sequence of the protein of the invention was found to have 257 of 
10 734 amino acids (35 %) identical and 403 of 734 (54%) homolog to a Cavia porcellus (guinea 
pig) complement C3 precursor (contains: C3A anaphylatoxin) (ACC:P12387; 1666 aa), and 255 
of 717 amino acid residues (35 %) identical to, and 401 of 717 residues (55 %) similar to, the 
1663 amino acid residue complement C3 precursor (contains: C3 A anaphylatoxin) from Homo 
sapiens (human) (ACC:P01024). 
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Hie disclosed MOL3 protein (SEQ ED NO:6) also has good identity with a number of 
complement compontent proteins, as shown in Table 3C. 



Table 3C BLAST results for MOL3 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
<%) 


Positives 
(%) 


Expect 


gi| 45573851 ref |NP 
J>00055.1| 


complement 
component 3 
precursor [Homo 
sapiens] 


1663 


633/1750 
(36%) 


938/1750 
(53%) 


0.0 


gii 11869931 |gb|AA 
G40565.1IAF154933 
1 (AF154933) 


complement 
component C3 
[Sus scrofa] 


1661 


617/1733 
(35%) 


935/1733 
(53%) 


0.0 


gii 309122 |gb|AAC4 
2013.1) (K02782) 


preprocomplemen 
t component C3 
[Mus musculus] 


1663 


606/1734 
(34%) 


915/1734 
(51%) 


0.0 



5 This information is presented graphically in the multiple sequence alignment given in 

Table 3D (with MOL3 being shown on line 1) as a ChistalW analysis comparing MOL3 with 
related protein sequences. 



10 
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Table 3D, Information for the ClustalW proteins: 

1) Novel MOL3 (SEQ ID NO: 6) 

2) gi |45573B5|refJNP_000055.1| complement component 3 precursor [Homo sapiens] 
(SEQ ID NO:41) 

3) gi|11869931|gb|AAG40565.1|AF154933_l (AF154933) complement component C3 [Sus 
scrofa] (SEQ ID NO: 42) 

4) gi|309122 | gb | AAC42013.il (K02782) prepro complement component C3 [Mus 
musculus] {SEQ ID NO: 43) 



MDL3 Prote 
NP_000055. 
AF154933 
K02782 



MDL3 Prote 
NP_000055 . 
AF154933 

K02782 

130 
I • - • '• I 

M0L3 Prote PTSGSSEttg M 

np_oooo55. p£y 

AP154933 
K02782 



M0L3 Prote 
NPj>00055. 
AF154933 
K02782 



50 60 

| ....|.. ..| 
DSRQPLTRT^ 58 
A RES S) 53 

g BB3 X 53 

A@ £ 55 




250 



260 



270 



280 



290 
.■I • 



300 
..I 
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MOI3 Prote 
NPJ>00055. 
AF154933 
R02782 



KOL3 Prote 
NP 000055. 
AF154933 
K02782 



M0L3 Prote 
NP_000055 . 
AF154933 
K027B2 



KOL3 Prote 
NPJ>00055. 
AF154933 
K02782 



MDL3 Prote 
NPJ>00055. 
AF154933 
R02782 



KDL3 Prote 
NPJ>00055. 
AF154933 
K02782 



MOL3 Prote 
NP_000055. 
AP154933 
K02782 



MDL3 Prote 
NP_000055 . 
AF154933 
K02782 



M0L3 Prote 
NP 000055. 
AF154933 
K02782 



MDL3 Prote 
HP_OO0O55. 
AF154933 
K02782 



MOL3 Prote 
NP_0 00055. 
AF154933 
K02782 



M0L3 Prote 



VLPSFEv 




VLPSFE'v 


JFff 


VLPSFEv 










910 920 930 940 950 

.1 ....|....| ....|.... | | | ....|.... | ....|. 

Wg|l^Kli5 HFSLLB3E^ KVD g fa g ffiflVG fl G— VOgHg g jggL flSGC§Y 910 
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NP_000055. 

AF154933 

K02782 



MOL3 Prote 
NP_000055. 
AP154933 
K02782 



WDL3 Prote 
NP 000055. 
API 54 933 
K02782 



MDL3 Prote 
NP000055 . 
AP154933 
K02782 - 



MOL3 Prote 
NP_000055. 
AF154933 
K02782 



PYVIVPi 
PY^IVPL 
PYVIVPL 




1010 1020 
|.. ..| ....|.... | 
gggDI LGETIVGSSt 953 
;ij@TS BagftSEvS 996 
994 
996 




1124 
1172 
1170 
1172 



1210 




■ I. 



1220 
. . I 



1230 



1240 



1250 
..I 



1260 




LQRgYgVAIA 
LQRgYTVAIft 

lSrBytvaip. 



1270 1280 1290 1300 1310 1320 

....|....| . ... | .... | I I ....|....| J. | 

MOL3 Prote ^^^^^ ^^^^^ A^^^^^ ^^^^ PG RSSRLSHPQR W P^^^A^ 1229 

1330 1340 1350 1360 1370 1380 

4 0 t ^^^^ ^^g^ tnre' gp ^^^^ & D ' 128^ 



M0L3 Prote 
NP_000055 . 
AF154933 
K02782 



MOL3 Prote 
NP_000055. 
AF154933 
K02782 



MOL3 Prote 
NP_000055. 
AF154933 
K02782 



MOL3 Prote 
MP 000055. 
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MOL3 shows significant homologies to human complement C3 proteins, as described in, 
but not limited to, the references below. 
5 It was found that transforming growth factor-betal acts as a potent inhibitor of 

complement C3 biosynthesis in human pancreatic cancer cell lines. Andoh et al. determined 
how transforming growth factor (TGF)-betal affects complement C3 secretion in the pancreatic 
cancer cell lines PANC-1 and BxPC-3. It is suggested that TGF-betal may act as a potent 
inhibitor of C3 secretion in pancreatic cancer cell lines under inflammatory conditions. This 

10 action of TGF-betal did not correlate with NF-kappaB activation, but associated with the 
translocation of Fos protein into the nucleus. 

The cellular localization of complement C3 and C4 transcripts were analyzed in intestinal 
specimens from patients with Crohn's disease. It has been suggested that the increase in C3 and 
C4 levels in jejunal perfusates of patients with Crohn's disease results from local intestinal 

1 5 synthesis of complement. Laufer et al. suggest that there is local regulated production of 

complement in the intestine of patients with CD, and subsequent complement activation may 
contribute to the inflammatory process. 

The generation of complement C3 and expression of cell membrane complement 
inhibitory proteins by human bronchial epithelium cell line. They found that the interrelationship 

20 between human airway epithelium and complement proteins may affect airway defence, airway 
function, and airway epithelial integrity. Local generation of complement C3 and expression of 
cell membrane CEP by human bronchial epithelium and its modulation by proinflammatory 
cytokines might be an additional regulatory mechanism of local airway defence and may affect 
airway function and epithelial integrity in health and disease. 

25 Janssen et al, Am J Kidney Dis 2000 Jan;35(l):21-8 suggested the activation of the acute 

phase response and complement C3 in patients with IgA nephropathy. The authors have shown 
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ystemic complement activation in patients with unmxmoglobulin A (IgA) nephropathy and 
reported that plasma levels of actC3 can indicate disease activity and renal outcome. 

Therapeutic applications 

5 

The expression pattern, and protein similarity information forMOL3 may function as a 
human complement C3-like protein. Therefore, the nucleic acid and protein of the invention are 
useful in potential therapeutic applications implicated, for example but not limited to, cancer, 
lung diseases, including asthma, immundeficiencies, inflammation, Crohn's disease, neurological 
10 disorders, nephropathy, and other diseases and disorders. The homology to antigenic secreted 
and membrane proteins suggests that antibodies directed against the novel genes may be useful 
in treatment and prevention of cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 
disorders. 

1 5 Potential therapeutic uses for the invention(s) are, for example but not limited to, the 

following; (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 

20 tissues and cell types derived from these tissues. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 
disorders. For example, but not limited to, a cDNA encoding the human complement C3-like 

25 protein may be useful in gene therapy, and the human complement C3-like protein may be useful 
when administered to a subject in need thereof. By way of non-limiting example, the 
compositions of the present invention will have efficacy for treatment of patients suffering from, 
for example, but not limited to, cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 

30 disorders. The novel nucleic acid encoding the human complement C3-like protein, and the 

human complement C 3 -like protein of the invention, or fragments thereof, may further be useful 
in diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel substances of the invention for use in therapeutic or diagnostic 

35 methods. 
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MOL4 

The disclosed Wnt 8-like protein, MOL4 (also referred to herein as AC004826), is 
encoded by a nucleic acid, 1064 nucleotides long (SEQ ID NO:7). An open reading frame was 
5 identified beginning with an ATG initiation codon at nucleotides 4-6 and ending with a TGA 
codon at nucleotides 1057-1059. Putative untranslated regions upstream from the initiation 
codon and downstream from the termination codon are underlined in Table 4A, and the start and 
stop codons are in bold letters. The encoded protein having 351 amino acid residues is presented 
using the one-letter code in Table 4B (SEQ ID NO:8). 

10 



Table 4A. MOL4 Nucleotide Sequence (SEQ ID NO:7). 

GCGATGGGGAACCTGTTTATGCTCTGGGCA6CTCT6GGCATATGCTGTGCTGCATTCAGTGCCTCTGCCT6GTCA 
GTGAACAATTTCCTGATAACAGGTC CCAAGGCCTATCTGACCT ACACGACTAGTGTGGCCTT GGGTGCCCAGAGT 
GGCATCGAGGAGTGCAAGTTCCAGTTTGCITCGGAACGCTGGAA 
C^CAACAGGCTGAGAAGTGCTACCAGAGAGACTTCCTTCAT^ 

ATCACCAAGAACTGTAGCTlTGGGTGACTrCGAAAACTGTGGCTGTGATGGGTCAAACAATGGAAAAATAGGAGGC 
CATGGCTGGATCTGGGGAGGCTGCAGCGACAATGTGGAATTTGGGGAAAGGATCTCCAAACTCTTTGTGGACAGT 
TTGGAGAAGGGGAAGGATGCCAGAGCCCTGATGAATCTTCACAACAAC^GGGCCGGCAGACTGGCAGTGAGAGCC 
ACCATGAAAAGGACATGCAAATGTCATGGCATCTCTGGGAGCTGCAGCATACAGACATGCTGGCTGCAGCTGGCT 
GAATTCCGGGAGATGGGAGACTACCTAAAGGCCAAGTATGACCAGGCGCTGAAAATTGAAATGGATAAGCGGCAG 
CTGAGAGCTGGGAACAGCGCCGAGGGCCACTGGGTGCCCGCTGAGGCCTTCCTTCCTAGCGCAGAGGCGGAACTG 
ATCTTTTTAGAGGAATCACCAGATTACTGTACCTGCAATTCCAGCCrGGGCATCTATGGCACAGAGGGTCGTGAG 
TGCCT ACAGAACAGCCACAACACAT CCAGGTGGGAGCGACGTAGCT GTGGGCGCCTGT GC ACTGAGTGTGGGCTG 
CAGGTGGAAGAGAGGAAAACTGAGGTCATAAGCAGCTGTAACTGCAAATTCCAGT GGTG CTG TAC GGT CAAGT GT 
GACCAGTGTAGGCATGTGGT GAGCAAGT ATTACT GCGC ACGCTCCCCAGGCAGT GCCCAGTCCCTGGGT AAGGGC 
AGTGCC TGATAAT A 



The disclosed nucleic acid MOL4 sequence has 881 of 1050 bases (83%) identical to a 
Mus musculus Wnt 8 mRNA (GENEANK-ID: MMWNT8DPT|acc:Z68889) and 637 of 955 
bases (66%) identical to a Homo sapiens Wnt 8 mRNA (GENB ANK-ID: 
15 HSWNT8|acc:Yl 1094). 

The MOL4 polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 is presented using the 
one-letter amino acid code in Table 4B. The Psort profile for MOL4 predicts that this sequence 
has a signal peptide and is likely to be localized outside the cell with a certainty of 0.7700. The 
most likely cleavage site for a MOL4 peptide is between amino acids 24 and 25 based on the 
20 SignalP result. 



Table 4B. MOL4 protein sequence (SEQ ID NO:8) 

MGNLFMLWAALGICCAAFSASAWSVNNFLITGPK^ 
RLRSATRETSFIHAISSAGVMYIITKHCSMGDFENCGCTC 
GKDARALMNLHNNRAGPXAVRATMKRTCKCRG 
NSAEGHWVPAEAFLPSAEAELlFIiEESPDYCTCNSSLGIY 

ICTSVISSC^CKFQWCCTVKCDQCRHVVSKYYCARSPGSAQSLGKGSA 
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The full amino acid sequence of the disclosed MOL4 polypeptide has 282 of 349 amino 
acid residues (80%) identical to, and 306 of 349 residues (87%) positive with, the 354 amino 
acid residue WNT-8D protein from Mils musculus (ptnr:SPTREMBL-ACC: Q64527), (E = 
1.5xl0" 15 ) 

5 BLASTP (Non-Redundant Composite database) analysis of the best hits for alignments 

with MOL4 are listed in Table 4C. 



Table 4C. BLASTP results for MOL4 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
taa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1116930461 
gb ] AAG3B662 . 
11 

(AY009402) 


WNT8d precursor [Homo 
sapiens] 


355 


335/348 
(96%) 


336/348 
(96%) 


0.0 


gi|6678169|r 
ef |MP 033316 
-11 


stimulated by retinoic 
acid gene 11 [Mus 
musculus] 


354 


271/349 
(77%) 


295/349 
(83%) 


l.Oe- 
148 


gi|104264-|pi 
r| IS18771 


developmental 
regulator Xwnt-8 - 
African clawed frog 


387 


246/335 
(73%) 


285/335 
(84%) 


l.Oe- 
136 


gi|1722844|s 
p|P51030|WN8 
C CHICK 


WNT-8C PROTEIN 
PRECURSOR (CWNT-8) 


357 


242/337 
(71%) 


283/337 
(83%) 


l.Oe- 
134 



1 0 This information is presented graphically in the multiple sequence alignment given in 

Table 4D (with MOL4 being shown on line 1) as a ClustalW analysis comparing MOL4 with 
related sequences. 

Table 4D Information for the ClustalW proteins: 



15 



20 



1) MOL4 (SBQ ID NO: 8) 

2) gi|11693046|gb|AAG38662.U (AY009402) WNT8d precursor [Homo sapiens] (SEQ 
ID N0:44) 

3) gi|6678169|ref |NP_033316. 1 1 stimulated by retinoic acid gene 11 [Mus 
musculus] (SEQ ID NO: 45) 

4) gi| 104264 |pir| IS18771 developmental regulator Xwnt-8 - African clawed frog 
(SEQ ID NO: 46) 

5) gi|1722844|sp|P51030|WN8C_CHICK WNT-8C PROTEIN PRECURSOR (CWNT-8) (SEQ ID 
NO:47) 



25 



30 



35 



MOIt4 Pro to 

AY009402 

NPJJ33316. 

S18771 

P51030 



MOL4 Prote 
A7009402 
NP 033316. 
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10 



15 



20 



25 



30 



35 



40 




S18771 
P51030 



M0L4 Pro to 

AY009402 

NP_033316. 

S18771 

P51030 



MOL4 Prota 

AY009402 

NP_033316. 

S18771 

P51030 



MOL4 Prota 

AY009402 

NP_033316. 

S18771 

P51030 



MOL4 Prote 

AY009402 

NP_033316. 

S18771 

P51030 



370 380 

I J I • - 

MOL4 Prote 351 

AY009402 I 355 

NP_033316. 354 

S18771 AFQKMSPTST RASGPVCLQD FETNLWT 3B7 

P51030 .357 



45 WNT genes encode intercellular signaling glycoproteins that play important roles in key 

processes of embryonic development such as mesoderm induction, specification of the 
embryonic axis, and patterning of the central nervous system, spinal cord, and limbs. The name 
WNT denotes the relationship of this family to the Drosophila segment polarity gene 'wingless, 1 
and to its vertebrate ortholog Intl , a mouse protooncogene; see WNT1 . It was noted that multiple 

50 WNT genes are known to exist in several species that have been investigated ranging from 

Drosophila to man. They have been classified into various groups and subgroups on the basis of 
high sequence homology and common expression patterns. The vertebrate WNT8 subfamily 
includes genes ftomXenopus> zebrafish, and chicken; The first mammalian WNT8 homolog, a 
human member of the Wnt8 family that they termed WNT8B was characterized on the basis of 

55 the very high sequence similarity (90-91% identity) of the inferred protein to those encoded by 
the Xenopus and zebrafish Wnt8b genes. The human cDNA encodes a 295-amino acid 
polypeptide that contains a C2H2 zinc finger-like motif. A predominant 1 .9-kb mRNA was 
detected in a variety of adult and fetal tissues. They used PCR typing of a human 
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monochromosomal hybrid cell panel to map the gene to chromosome 10, and fluorescence in situ 
hybridization for localization at 10q24. 

The full-length cDNA sequence and genomic organization of the human WNT8B gene 
was presented and reported studies of expression of the gene in human and mouse embryos. The 
5 WNT8B gene contains six exons separated by small introns, with the exception of intron 1 . The 
predicted protein has 351 amino acids. The gene is expressed predoininantly as a transcript of 
approximately 2.1 kb. The human and mouse expression patterns appeared to be identical and 
were restricted to the developing brain, with the great majority of expression being found in the 
developing forebrain. In the latter case, expression was confined to the germinative 
10 neuroepithelium of three sharply delimited regions: the dorsomedial wall of the telencephalic 
ventricles (which includes the developing hippocampus), a discrete region of the dorsal 
thalamus, and the mammillary and retromammary regions of the posterior hypothalamus. 
Expression in the developing hippocampus may suggest a role for WNT8B in patterning of this 
region, and subchromosomal localization of the human gene to 10q24 may suggest it as a 
15 candidate gene for partial epilepsy (EPT; OMM-600512) in families in which the disease has 
been linked to markers in this region. 

WNT1 is a member of a family of cysteine-rich, glycosylated signaling proteins that 
mediate diverse developmental processes such as the control of cell proliferation, adhesion, cell 
polarity, and the establishment of cell fates. Wntl was identified as an oncogene activated by the 
20 insertion of mouse mammary tumor virus in virus-induced mammary adenocarcinomas. 

Although Wntl is not expressed in the normal mammary gland, expression of Wntl in transgenic 
mice causes mammary tumors. To identify downstream genes in the WNT signaling pathway 
that are relevant to the transformed cell phenotype, A PCR-based cDNA subtraction strategy was 
used, suppression subtractive hybridization. It was reported that the identification of two genes, 
25 WISP1 and WISP2, that are upregulated in the mouse mammary epithelial cell line transformed 
by Wntl, but not by Wnt4. Together with a third related gene, WISP3, these proteins define a 
subfamily of the connective tissue growth factor family. Two distinct systems demonstrated 
WISP induction to be associated with the expression of WNTl. WISP1 genomic DNA was 
amplified in colon cancer cell lines and in human colon tumors and its RNA overexpressed in 
30 84% of the tumors examined compared with patient-matched normal mucosa. WISP3 also was 
overexpressed in 63% of colon tumors analyzed. In contrast, WISP2 showed reduced RNA 
expression in 79% of the tumors. These results suggested that WISP genes may be downstream 
of WNTl signaling and that aberrant levels of WISP expression in colon cancer may play a role 
in colon tumorigenesis. 
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It was found that the WISP1 cDNA encodes a 367-amino acid protein. Mouse and human 
WISP1 proteins are 84% identical; both have hydrophobic N-terminal signal sequences, 38 
conserved cysteine residues, and 4 potential N-linked gjycosylation sites. Alignment of the three 
human WISP proteins showed that WISP1 and WISP3 are most similar (42%), whereas WISP2 
5 had 37% identity with WISP1 and 32% identity with WISP3. 

Uses of the Compositions of the Invention 

The above defined information for this invention suggests that MOL4 may function as a 
member of the "Wnt 8 family". Therefore, the novel nucleic acids and proteins identified here 

10 may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targetmg/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration 

15 in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in neurodegenerative disorders, epilepsy, cancers including but not 
limited to brain tumor, colon cancer and breast cancer, developmental disorders, neural tube 
defects, and/or other pathologies and disorders. For example, a cDNA encoding the Wnt 8-like 

20 protein may be useful in gene therapy, and the Wnt 8-like protein may be useful when 

administered to a subject in need thereof. By way of nonlimiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from 
neurodegenerative disorders, epilepsy, cancers including but not limited to brain tumor, colon 
cancer and breast cancer, developmental disorders, and neural tube defects,. The novel nucleic 

25 acid encoding Wnt 8-like protein, and the Wnt 8-like protein of the invention, or fragments 

thereof, may further be useful in diagnostic applications, wherein the presence or amount of the 
nucleic acid or the protein are to be assessed. These materials are further useful in the generation 
of antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

30 

MOL5 

The disclosed novel Beta Thymosin-like MOL5 nucleic acid of 215 nucleotides (also 
referred to as AC025535) is shown in Table 5A. An ORP begins with an ATG initiation codon at 
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nucleotides 4-7 and ends with a TGA codon at nucleotides 211-213. A putative untranslated 
region upstream from the initiation codon and downstream from the termination codon is 
underlined in Table 5A, and the start and stop codons are in bold letters. 



Table 5A. MOL5 Nucleotide Sequence (SEQ ID NO:9) 

AGT AT GGT CT CAG C C CAG CGTTTCA CGAGT CTT CAAG C CTT CAGGCTTT C TTTAATCAAGATGAGT GAT AAC CC 
AAACT TGT CAGAAGTGAAG TTTGACAGGTCAAAATTGAAGAAAA CTAACACTGGAGAAAAAAATAGGCTTT CTT 
CCAAGGAAACTATCCAGCAGGAGAAATACGGTGTTCAAACATCATATAATGGGC^OT 



The MOL protein encoded by SEQ ID NO:9 has 69 amino acid residues and is presented 
using the one-letter code in Table 5B. The Psort profile for MOL5 predicts that this sequence 
has a signal peptide and is likely to be localized at the mitochondrial intermembrane space with a 
certainty of 0.8800. Using the SIGNALP analysis, the protein of the invention does not appear to 
1 0 contain a predictable signal peptide. 



Table 5B. Encoded MOL5 protein sequence (SEQ ID NO:10) 

MVSAQRFTSLQAFRLSLIKMSDNPNIiSEVKFDRSKLKKTNTQEK^ 



The disclosed nucleic acid sequence for MOL5 hasl67 of 191 bases (87%) identical to a 
Homo sapiens Beta ThymosmmRNA (GENBANK-ID: D82345|acc:D82345) (E= 5.1e" 26 ). 
1 5 The full MOL5 amino acid sequence has 37 of 45 amino acid residues (82%) identical to, 

and 38 of 45 residues (84%) positive with, the 45 amino acid residue Thymosin beta protein 
from Homo sapiens (ptnr: PIR-ID: JC5274) (E= 1 .2e* ! ! ). 

MOL5 also has homology to other proteins as shown in BLAST alignment results in 



Table 5C. 



Table 5C. BLAST results for MOL5 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


ref |NPJ)68B32.1| 


thymosin, beta, 
identified in 
neuroblastoma 
cells [Homo 
sapiens] 


45 


37/45 
(82%) 


38/45 
(84%) 


5e-06 


pirl | 152084 


thymosin beta-4 
precursor - rat 
(fragment) 


56 


27/39 
(69%) 


33/39 
(84%) 


2e-04 


sp|P200651TYB4 MOUS 
E 


THYMOSIN BETA-4 


50 


27/39 
(69%) 


33/39 
(84%) 


3e-04 


gb 1AAA36746.1I 
(M92383) 


thymosin beta-10 
[Homo sapiens] 


49 


24/40 
(60%) 


32/40 
(80%) 


0.002 


gb IAAB37101.1I 
(025684) 


thymosin beta- 
like protein 
[Rattus 
norvegicus] 


45 


31/39 
(79%) 


34/39 
(86%) 


0.002 
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This information is presented graphically in the multiple sequence alignment given in 
Table 5D (with MOL5 being shown on line 1) as a ClustalW analysis comparing MOL5 with 
related protein sequences. 

5 



Table 5D Information for the ClustalW proteins: 



1) MOL5 {SEQ ID NO: 10) 

2) ref iNP_068832.1| thymosin, beta, identified in neuroblastoma cells [Homo 
10 sapiens] (SEQ ID NO: 48) 

3) pir||I52084 thymosin beta-4 precursor - rat (fragment) (SEQ ID N0:49) 

4) sp|P20065|TYB4_MOUSE THYMOSIN BETA-4 (SEQ ID N0:50) 

5) gb IAAA36746.il (M92383) thymosin beta-10 [Homo sapiens] (SEQ ID N0:51) 

6) gb IAAB37101.il (U25684) thymosin beta-like protein [Rattus norvegicus] (SEQ 
15 ID N0:52) 




70 

....|....| 

MOL5 Prote VQTSYNGGffA 69 

NP_068832. VQTS 45 

152084 G£3 56 

P2006S GBS 50 

M92383 EIS 49 

U25684 NQRS 45 



Thymosin-beta-4 induces the expression of terminal deoxynucleotidyl transferase activity 
20 in vivo and in vitro, inhibits the migration of macrophages, and stimulates the secretion of 
hypothalamic luteinizing hormone-releasing hormone. It was noted that the protein was 
originally isolated from a partially purified extract of calf thymus, thymosin fraction 5, which 
induced differentiation of T cells and was partially effective in some immunocompromised 
animals. Further studies demonstrated that the molecule is ubiquitous; it had been found in all 
25 tissues and cell lines analyzed It is found in highest concentrations in spleen, thymus, lung, and 
peritoneal macrophages. It was stated that thymosin-beta-4 is an actin monomer sequestering 
protein that may have a critical role in modulating the dynamics of actin polymerization and 
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depolymerization in nonmuscle cells. Its regulatory role is consistent with the many examples of 
transcriptional regulation of T-beta-4 and of tissue-specific expression. Lymphocytes have a 
unique T-beta-4 transcript relative to tiie ubiquitous transcript found in many other tissues and 
cells. It was stated that rat thymosin-beta-4 is synthesized as a 44 -amino acid propeptide which is 
5 processed into a 43-amino acid peptide by removal of the first methionyl residue. The molecule 
does not have a signal peptide. Human thymosin-beta-4 has a high degree of homology to rat 
thymosin-beta-4; the coding regions differ by only 9 nucleotides, and these are all silent base 
changes. 

By differential screening of a cDNA library prepared from leukocytes of an acute 
10 lymphocytic leukemia patient, a cDNA encoding thymosin-beta-4 was isolated. Using Northern 
blot analysis, the expression of the 830-nucleotide thymosin-beta-4 mRNA in various primary 
myeloid and lymphoid malignant cell lines and in hemopoietic cell lines was studied. It was 
stated that the pattern of thymosin-beta-4 gene expression suggests that it may be involved in an 
early phase of the host defense mechanism. 

15 A cDNA clone for the human interferon-inducible gene 6-26 was isolated and showed 

that its sequence was identical to that for the human thymosin-beta-4 gene. By use of a panel of 
human rodent somatic cell hybrids, it was shown that the 6-26 cDNA recognized seven genes, 
members of a multigene family, present on chromosomes 1, 2, 4, 9, 1 1, 20, and X. These genes 
are symbolized TMSL1, TMSL2, etc., respectively. Li et al. (1996) established that in the mouse 

20 there is a single Tmsb4 gene and that the lymphoid-specific transcript is generated by extending 
the ubiquitous exon 1 with an alternate downstream splice site. By interspecific backcross 
mapping, they located the mouse gene, which they symbolized Ptmb4, to the distal region of the 
mouse X chromosome, linked to Btk and Gja6. Thus, the human gene could be predicted to 
reside on the X chromosome in the general region of Xq21 .3-q22, where BTK is located By 

25 analysis of somatic cell hybrids, the thymosin-beta-4, or TB4X, gene were mapped to the X 
chromosome. They noted that a homologous gene, TB4Y, is present on the Y chromosome. 

It was stated that prostate carcinoma is the most prevalent form of cancer in males and 
the second leading cause of cancer death among older males. The use of the serum prostate- 
specific antigen test permits early detection of human prostate cancer; however, early detection 
30 has not been accompanied by an improvement in determining which tumors may progress to the 
metastatic stage. The process of tumor metastasis is a multistage event involving local invasion 
and destruction of extracellular matrix; intravasation into blood vessels, lymphatics or other 
channels of transport; survival in the circulation; extravasation out of the vessels into the 
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secondary site; and growth in the new location. Common to many components of the metastatic 
process is the requirement for tumor cell motility. A well-characterized series of cell lines that 
showed varying metastatic potential was developed from the Dunning rat prostate carcinoma. A 
direct correlation between cell motility and metastatic potential in the Dunning cell lines was 
5 shown. In studies comparing gene expression in poorly and highly motile metastatic cell lines 
derived from Dunning rat prostate carcinoma using differential mRNA display, Bao et al. (1996) 
found a novel member of the thymosin-beta family of actin-binding molecules. The molecule, 
named thymosin-beta- 15 by them, was found to deregulate motility in prostate cells directly. In 
addition, it was expressed in advanced human prostate cancer specimens, but not in normal 

10 human prostate or benign prostatic hyperplasia, suggesting its potential use as a new marker for 
prostate carcinoma progression. Bao et al. (1996) found that thymosin-beta-15 levels correlated 
positively with the Gleason tumor grade. Coffey (1996) pointed out that the upregulation of 
thymosin-beta-15 as a positive motility factor and the down regulation of the motility suppressor 
KAI1 (OMIM- 600623) provide the 'yin and yang' for metastasis; he speculated that these 

15 pathways may provide a new target for therapy. 

Angiogenesis is an essential step in the repair process that occurs after injury. In a study, 
the angiogenic thymic peptide thymosin beta4 (Tbeta4) enhanced wound healing in a rat full 
thickness wound model was examined. Addition of Tbeta4 topically or intraperitoneally 
increased reepithelialization by 42% over saline controls at 4 d and by as much as 61% at 7 d 

20 post-wounding. Treated wounds also contracted at least 1 1% more than controls by day 7. 

Increased collagen deposition and angiogenesis were observed in the treated wounds. We also 
found that Tbeta4 stimulated keratinocyte migration in the Boyden chamber assay. After 4-5 h, 
migration was stimulated 2-3-fold over migration with medium alone when as little as 10 pg of 
Tbeta4 was added to the assay. These results suggest that Tbeta4 is a potent wound healing 

25 factor with multiple activities that may be useful in the clinic. 

Uses of the Compositions of the Invention 

The above defined information for this invention suggests thatMOLS may function as a 
member of a "Beta Thymosin family". Therefore, the novel nucleic acids and proteins identified 
30 here may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
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prognostic marker, gene therapy (gene deliveiy/gene ablation), research tools, tissue regeneration 
in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in cancer including but not limited to prostate cancer, immunological 
5 and autoimmune disorders {i.e., hyperthyroidism), angiogenesis and wound healing, modulation 
of apoptosis, neurodegenerative and neuropsychiatry disorders, age-related disorders, and other 
pathological disorders involving spleen, thymus, lung, and peritoneal macrophages and/or other 
pathologies and disorders. For example, a cDNA encoding the Beta Thymosin-like protein may 
be useful in gene therapy, and the Beta Thymosin-like protein may be useful when administered 

10 to a subject in need thereof. By way of nonlimiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from cancer including but not 
limited to prostate cancer, immunological and autoimmune disorders (Le., hyperthyroidism), 
angiogenesis and wound healing, modulation of apoptosis, neurodegenerative and 
neuropsychiatric disorders, age-related disorders, and other pathological disorders involving 

15 spleen, thymus, lung, and peritoneal macrophages. The novel nucleic acid encoding Beta Thymosin- 
like protein, and the Beta Thymosin-like protein of the invention, or fragments thereof, may 
further be useful in diagnostic applications, wherein the presence or amount of the nucleic acid 
or the protein are to be assessed. These materials are further useful in the generation of 
antibodies mat bind immunospecifically to the novel substances of the invention for use in 

20 therapeutic or diagnostic methods. 

MOL6 

MOL6a 

The disclosed novel Trypsin-like MOL6a nucleic acid of 730 nucleotides (also referred to 
as GM_87760758_A) is shown in Table 6 A. An open reading begins with an ATG initiation 
25 codon at nucleotides 8-10 and ends with a TGA codon at nucleotides 713-715. A putative 

untranslated region upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 6A, and the start and stop codons are in bold letters. 



Table 6A. MOL6a Nucleotide Sequence (SEQ ID NO:ll) 

GATC^CCA T<3^^TaTGTCTTCTATTTGGGTGTCCTCGCTGG6RCATTTTTC?TTGCTGACTCATCTGTTCA6^ 
AGAAGACCCTGCTCCCTATTTGGT GT ACCTCAAGTCT CACTT CAACCCCTGTGTGGGCGTCCTCATCAAACCCAG 
CTGGGTGCTGGCCCCAGCTCACTGCTATTTACCAAATCTGAAA 

AGACGGT ACT GAACAG AC AAT T AACC CCATTCAGAT C GT C CG CT ACTGGAACT ACAGT CATAGC GCCCC ACAGG A 

IHSACCTCATGCTCATCAAGCTGGCTAAGCCTGCCATGCTGAATCCCAAAGTCC^ 

CAATGTCAGGCCAGGCACTGTCTGTCTACTCTCAGGTTTGGACTGG 

CTTGCGGCAGAACCTG GAGGC CC C CGTGATGTCT GATCGAGAAT G CCAAAAAACAGAACAAGGAAAAAGCCACAG 
GAATTCCTTATGTGTCAAATTTGTGAAAGTATTCAGCCGAATTTTTGGGG 
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CAAAGACAAGCTCCAGGGAATCGAGGTGGGGCACTTCATGGG^ 

ATATGTATCC TGGAT TGAGAACAC TG CTAAGGAC AAG TGAGACC C TACTTCTC CC 



Hie disclosed nucleic acid sequence has 354 of 581 bases (60%) identical to &Mus 
musculus prepro-Trypsininogen mRNA (GENBANK-ID: MMTRYARjacc:X04574) (E value = 
9.9e- 24 ). 

5 The MOL6a protein encoded by SEQ ID NO: 1 1 has 235 amino acid residues, and is 

presented using the one-letter code in Table 6B (SEQ ID NO: 12). The Psort profile for MOL6a 
predicts that this sequence has a signal peptide and is likely to be localized on the outside with a 
certainty of 0.3700. The most likely cleavage site for a peptide is between amino acids 19 and 
20, ADS-SV based on the SignalP result 

10 



Table 6B. Encoded MOL6a protein sequence (SEQ ID NO: 12). 

MKYV FYLGVIAGT F FFADSS VQKEDPAP YLVYLKSH FN PCVGVLI KPSWVLAPAHCYLPNLKVMLGNFKS RVRDGT 
EOTINPIQIVRYWNYSHSAPQDDLMLIKLAKPA 

L EAPVM S DRECQKTEQGKSHRN S LCVKFVKVFS RI FGE VAVATVICKDKLQG I EVGH FMGG DVG I YTNVYKYVS W I 
ENTAKDK 



The full amino acid sequence of MOL6a was found to have 79 of 208 amino acid 
residues (37%) identical to, and 118 of 208 residues (56%) positive with, the 248 amino acid 
residue TRIPSINOGEN I-Pl PRECURSOR (EC 3.4.21.4) protein from Gallus gallus (ptnr: 
15 SWISSNEW-ACC: Q90627) (E value = Lie- 33 ). 

MOL6 also has high homology to the proteins shown in the BLAST data in Table 6C. 



SNP analysis of MOL6a is described in Example 2. 



Table 6C. BLAST results for MOL6a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|2499B62|sp|Q9062 
7 1 TRYl^CHICK 


TRYPSIN I-Pl 
PRECURSOR 


248 


79/208 
(37%) 


118/208 
(55%) 


2e-31 


gi|2118087|pir| |S55 
067 


trypsin (EC 
3.4.21.4) X 
precursor, 
pancreatic - 
chicken 


248 


78/20B 
(37%) 


117/208 
(55%) 


5e-31 


gi|6678439|ref INP 0 
33456.11 


trypsin 2 [Mus 
musculus] 


246 


77/215 
(35%) 


115/215 
(52%) 


8e-29 


gil 1633123 |pdb|lSLW 
IB 


Chain B, Rat 
Anionic Nl43h, 
E151h Trypsin 
Complexed To A86h 
Ecotin 


223 


74/212 
(34%) 


116/212 
(53%) 


le-28 



MOL6b 

20 In the present invention, the target sequence identified previously, MOL6a, Accession 

Number GM_87760758_A, was subjected to the exon linking process to confirm the sequence. 
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PCR primers were designed by starting at the most upstream sequence available, for the forward 
primer, and at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was encountered, or, 

5 in the case of the reverse primer, until the stop codon was reached. Such suitable sequences 
were then employed as the forward and reverse primers in a PCR amplification based on library 
containing a wide range of cDNA species. The resulting amplicon was gel purified, cloned and 
sequenced to high redundancy to provide the sequence reported below, which is designated 
MOL6b, Accession Number GM_87760758_A_da 

10 The disclosed novel Trypsin-like MOL6b nucleic acid of 730 nucleotides (also referred 

to as GM_87760758_A_da) is shown in Table 6D. An open reading frame begins with an ATG 
initiation codon at nucleotides 8-10 and ends with a TGA codon at nucleotides 713-715. A 
putative untranslated region upstream from the initiation codon and downstream from the 
termination codon are underlined in Table 6A, and the start and stop codons are in bold letters. 

15 



Table 6D. MOL6b Nucleotide Sequence (SEQ ID NO:13) 

C^TCACCATGAAATATGTCTTCTATTT^ 
AAGAAGACCCTGCTCCCTATTTGGTGTACCTCAAGTCTCAC 

AG CTGGGTGC TGGCC C CAG CT CACTG CT ATT TAC CAAAT CT GAAAGTGATGCTGGGAAAT TT CAAGAGCAGAGT 

CAGAGACGGTACTGAACAGACAATTAACCCC^TTCAGATCGrcCGCTACTGGAACTACAGTCATAGCGCCCCA^ 

AGGATGACCTCATGCTCATCAAGCT<^CTAAGCCTGC 

ACCACCAATGTCAGGCCAGGGACTGTCTGTCTACTCTCAGGTTTGGACTGG 

CCCTGACTTGCGGCAGAACCTGGAGGCCCCCGTGATGTCTGATCGAGAATGC 

GCCACAGGLAATTCCTTATGTGTGAAATTTGTGAAAGTATTCA^ 

CTCATCTGCAAAGACAAGCTCCAGGGAATCGAGGTGGGGCAOT 

TG TTT ACAAATATGTATC C TGGATTGAGAACA CTGCTAAGGACAAGTGAGAC C CT ACT TCTCC C 

The MOL6b protein encoded by SEQ ID NO: 13 has 235 amino acid residues, and is 
20 presented using the one-letter code in Table 6E (SEQ ID NO: 14). The Psort profile for MOL6a 
predicts that this sequence has a signal peptide and is likely to be localized on the outside with a 
certainty of 0.3700. The most likely cleavage site for a peptide is between arnino acids 19 and 
20 based on the SignalP result. 



Table 6E. Encoded MOL6b protein sequence (SEQ ID NO:14). 

MKYVFYLGVLAC^ 

EQTINPIQIVRYl^YSHSAPQDDLMLIKLA^ 
LEAPVMSDRECQKTEQGKSHRNSLCVKFVKVF^ 

ENTAKDK 



45 



WO 01/81578 



PCT/US01/13578 



The full amino acid sequence of M0L6b was found to have homology with several 
proteins including those disclosed in the BLASTP data in Table 6F. 



Table 6F. BLAST results for MOLfflb 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(as) 


Identity 
(%) 


Positives 
(%) 


Expect 


ACC:Q90627 


TRYPSIN I-Pl 
PRECURSOR (EC 
3.4.21.4) - 
Gallus gallus 
(Chicken) 


248 


79/208 
(37%) 


118/208 
(56%) 


3.8e- 
34 


PIR-ID:S55067 


trypsin (EC 
3.4.21.4) I 
precursor, 
pancreatic - 
chicken 


248 


78/208 
(37%) 


117/208 
(55%) 


7.9e- 
34 


ACC:Q90628 


TRYPSIN I-P38 
PRECURSOR (EC 
3.4.21.4) - 
Gallus gallus 
(Chicken) 


248 


78/208 
(37%) 


117/208 
(56%) 


l.Oe- 

33 


ACC:P07477 


TRYPSIN I 
PRECURSOR (EC 
3.4.21.4) 
{CATIONIC 
TRYPSINOGEN) - 
Homo sapiens 
(Human) 


247 


76/212 
(35%) 


112/212 
(52%) 


1.3e- 
31 



5 

MOL6b also has high homology to the proteins shown in the BLASTX alignment data in 
Table 6G. 



table 6G. BLASTX results for MOL6b 






Smallest 








Sum 




Reading 


High 


Prob 




Sequences producing High-scoring Segment Pairs: Frame 


Score 


P(N) 


N 


ptnr:SWISSPROT-ACC:Q90627 TRYPSIN I-Pl PRECURSOR (EC 3. +2 


372 


2.3e-33 


1 


ptnr:PIR-ID:S55067 trypsin (EC 3.4.21.4} I precursor, .+2 


369 


4.7e-33 


1 


ptnr:SWISSPROT-ACC:Q90628 TRYPSIN I-P38 PRECURSOR (EC .+2 


368 


6.0e-33 


1 


ptnr:SWISSPROT-ACC:P07146 TRYPSIN II, ANIONIC FRECURSO. +2 


350 


4 . 9e-31 


1 


ptnr:SWISSPROT-ACC:P07477 TRYPSIN I PRECURSOR (EC 3. 4.. +2 


348 


7.9e-31 


1 


ptnr : SPTREMBL-ACC : Q9R0T7 PANCREATIC TRYPSIN - Mus muse. +2 


348 


7.9e-31 


1 


ptnr:SWISSPROT-ACC:P15951 TRYPSIN III PRECURSOR (EC 3.. +2 


347 


i.0e-30 


1 



This information is presented graphically in the multiple sequence alignment given in 
10 Table 6H (with MOL6a being shown on line 1 and MOL6b being shown on line 2) as a 
ClustalW analysis comparing MOL6 with related protein sequences. 

Table 6H Information for the ClustalW proteins: 
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1) MOL6a (SEQ ID NO: 12) 

2) MOL6b (SEQ ID NO: 14) 

3) gi 124998 62 |sp|Q90 627 |TRY1_CHICK TRYPSIN I-Pl PRECURSOR (SEQ ID NO: 53) 

4) gi|2U8087|pir| [S55067 trypsin (EC 3.4.21.4) I precursor, pancreatic - 
chicken (SEQ ID NO: 54) 

5) gi|6678439|ref |NP_033456.1 j trypsin 2 [Mus rausculus] (SEQ ID NO:55) 

6) gi|l633123|pdb|lSLW|B Chain B, Rat Anionic N143h, E151h Trypsin Complexed To 
A86h Ecotin (SEQ ID NO: 56) 




QKTEQ 




3 170 


QKTEQ 




R 170 


ssayp 




R 180 


SSRYP 




R 180 


EASYP 




D 17B 


PDGKKE, 


R 94 



227 
227 
240 
240 
238 
135 




235 
235 
SN 248 
[SSN 248 
246 

DMAWR- 142 



MOL6b also contained several single polynucleotide polymorphisms described in Table 

61. 



Table 61. SNP for MOL6b 


Position 


Nucleotide Change 


Number of Occurrences 


70 


OA 


2 


70 


OG 


6 


261 


T>C 


2 


406 


A>C 


2 
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573 


G>T 


2 


585 


OT 


2 


737 


A>G 


4 



Trypsin (EC 3.4.21.4 ), like elastase, is a member of the pancreatic family of serine 
proteases. The gene encoding trypsin-1 (TRY1) is also referred to as serine protease- 1 (PRSS1). 
MacDonald et al. (1982) reported nucleotide sequences of cDNAs representing 2 pancreatic rat 
5 trypsinogens. Using a rat cDNA probe, Honey et al. (1984, 1984) found that a 3.8~kb DNA 
fragment containing human trypsin-1 gene sequences cosegregated with chromosome 7, and 
assigned the gene further to 7q22-7qter by study of hybrids with a deletion of this segment. The 
trypsin gene is on mouse chromosome 6 (Honey et al., 1984). Carboxypeptidase A and trypsin 
are a syntenic pair conserved in mouse and man. Erni et al. (1986) isolated cDNA clones for two 

10 major human trypsinogen isozymes from a pancreatic cDNA library. The deduced amino acid 
sequences had 89% homology and the same number of amino acids (247), including a 15-amino 
acid signal peptide and an 8-amino acid activation peptide. Southern blot analysis of human 
genomic DNA with the cloned cDNA as a probe showed that the human trypsinogen genes 
constitute a family of more than ten, some of which may be pseudogenes or may be expressed in 

1 5 other stages of development. 

Rowen et al. (1 996) found that there are eight trypsinogen genes embedded in the beta T 
cell receptor locus or cluster of genes (TCRB; OMM- 186930), which maps to 7q35. In the 685- 
kb DNA segment that they sequenced they found five tandemly arrayed 10-kb locus-specific 
repeats (homology units) at the 3-prime end of the locus. These repeats exhibited 90 to 91% 

20 overall nucleotide similarity, and embedded within each is a trypsinogen gene. Alignment of 

pancreatic trypsinogen cDNAs with the germline sequences showed that these trypsinogen genes 
contain five exons that span approximately 3.6 kb. Further analyses revealed 2 trypsinogen 
pseudogenes and one relic trypsinogen gene at the 5-prime end of the sequence, all in inverted 
transcriptional orientation. They denoted eight trypsinogen genes Tl through T8 from 5-prime to 

25 3-prime. Rowen et al. (1 996) found that only two of three pancreatically expressed trypsinogen 
cDNAs correspond to trypsinogen genes in the TCRB locus; T4 was denoted trypsinogen 1 and 
T8 was denoted trypsinogen 2 (OMIM- 601564). The third pancreatic cDNA, identified 
independently as trypsinogen 3 (Tani et al., 1990) and 4 (Wiegand et al., 1993), is distinct from 
the third apparently functional trypsinogen gene (T6) in the TCRB locus but related to the other 

30 pancreatic trypsinogens. Rowen et al. (1996) stated that the T6 gene is deleted in a common 

insertion-deletion polymorphism; if it is functional, its function is apparently not essential. Some 
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of the trypsinogen genes are expressed in nonpancreatic tissues where their function is unknown. 
Rowen et al. (1996) noted that the intercalation of the trypsinogen genes in the TCRB locus is 
conserved in mouse and chicken, suggesting shared functional or regulatory constraints, as has 
been postulated for genes in the major histocompatibility complex (such as class I, n, and HI 
5 genes) that share similar long-term organizational relationships. 

Rowen et al. (1996) mapped the gene corresponding to the third pancreatic trypsinogen 
cDNA by fluorescence in situ hybridization. They used a cosmid clone containing 3 trypsinogen 
genes. Strong hybridization to chromosome 7 and weaker hybridization to chromosome 9 were 
observed. They isolated and partially sequenced 4 cosmid clones from the chromosome 9 region. 

1 0 They found that the region represents a duplication and translocation of a DNA segment from the 
3-prime end of the TCRB locus that includes at least seven V(beta) elements and a functional 
trypsinogen gene denoted T9. The assignment of the PRSS1 gene to 7q35 is established by the 
demonstration of its sequence within the sequence of the locus' (OMIM- 186930) for the T-cell 
receptor beta-chain (Rowen et al., 1996). It is further supported by the linkage between 

15 microsatellite markers in the 7q35 region and hereditary pancreatitis (OMIM- 167800) and the 
demonstration of mutations in the PRSS1 gene in hereditary pancreatitis. 

Whitcomb et al. (1996) stated that the high degree of DNA sequence homology (more 
than 91%) present among this cluster of five trypsinogen genes identified by Rowen et al. (1996) 
demanded that highly specific sequence analysis strategies be developed for mutational 

20 screening in families with hereditary pancreatitis. This was necessary to ensure that each 

sequencing run contained only the two alleles corresponding to a single gene, thereby permitting 
detection of heterozygotes in this autosomal dominant disorder, and not a dozen or more alleles 
from multiple related trypsinogen-like genes, which would make detection of heterozygotes 
nearly impossible. In a family with hereditary pancreatitis, Whitcomb et al. (1 996) found that 

25 affected individuals had a single G-to-A transition mutation in the third exon of cationic 
trypsinogen (276000.0001). This mutation was predicted to result in an argl05-to-his 
substitution in the trypsin gene (residue number 1 17 in the more common chymotrypsin number 
system). Subsequently, the same mutation was found in a total of five different hereditary 
pancreatitis kindreds (four from the U.S. and one from Italy) containing a total of 20 affected 

30 individuals and six obligate carriers. The mutation was found in none of the obligate unaffected 
members (individuals who married into the family). Subsequent haplotyping revealed that all 
four of the American families displayed the same high risk haplotype over a 4-cM region 
encompassing seven STR markers, confirming the likelihood that these kindreds shared a 
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common ancestor, although no link could be found through eight generations. A fifth family 
from Italy displayed a unique haplotype indicating that the same mutation had occurred oh at 
least 2 occasions. The G-to-A mutation at codon 117 created a novel enzyme recognition site for 
Aflin which provided a facile means to screen for the mutation. As with the obligate unaffected 
5 members of the pancreatitis kindreds, none of 140 controls possessed the G-to-A mutation as 
assayed by the lack of AflTTT digestion of the amplified exonic DNA. 

Failure to thrive, nutritional edema, and hypoproteinemia with normal sweat electrolytes 
were features of 2 affected male infants reported by Townes (1965) and Townes et al. (1967). A 
protein hydrolysate diet was beneficial. A male sib of the first patient reported by Townes (1965) 
10 had died, apparently of the same condition. Morris and Fisher (1967) reported an affected female 
who also had imperforate anus. The clinical picture in enterokinase deficiency (OMIM- 226200) 
is closely similar; however, the defect is not in the synthesis of trypsinogen but in the synthesis 
of the enterokinase which activates proteolytic enzymes produced by the pancreas. Oral 
pancreatin represents a therapeutically successful form of enzyme replacement (Townes, 1972). 

1 5 Since hereditary pancreatitis has been mapped rather precisely to 7q35 and since a defect in the 
trypsinogen gene has been identified in hereditary pancreatitis, the assignment of the trypsinogen 
gene can be refined from 7q32-qter to 7q35. 

Ferec et al. (1999) studied 14 families with hereditary pancreatitis and found mutations in 
the PRSS1 gene in 8 families. In 4 of these families, the mutation (Rl 1 7H; 276000.0001) had 
20 been described by Whitcomb et al. (1996). Three novel mutations were described in 4 other 
families. 

Sahin-Toth et al. (1999) studied the roles of the R117H and N21I (276000.0002) 
mutations in hereditary pancreatitis. They stated that the Rl 17H mutation is believed to cause 
pancreatitis by eliminating an essential autolytic cleavage site in trypsin, thereby rendering the 

25 protease resistant to inactivation through autolysis. Sahin-Toth et al. (1999) demonstrated that 
the Rl 17H mutation also significantly inhibited autocatalytic trypsinogen breakdown under 
Ca(2+)-tree conditions and stabilized the zymogen form of rat trypsin. Taken together with 
findings demonstrating that the N21I mutation stabilized rat trypsinogen against autoactivation 
and consequent autocatalytic degradation, the observations suggested a unifying molecular 

30 pathomechanism for hereditary pancreatitis in which zymogen stabilization plays a central role. 

Uses of the Compositions of the Invention 
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The above defined information for this invention suggests that this Trypsin-like protein 
may function as a member of a 'Trypsin family*'. Therefore, the novel nucleic acids and proteins 
identified here may be useful in potential therapeutic applications implicated in (but not limited 
to) various pathologies and disorders as indicated below. The potential therapeutic applications 
5 for this invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targetmg/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration 
in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The nucleic acids and proteins of the invention are useful in potential therapeutic 

10 applications implicated in failure to thrive, nutritional edema, and hypoproteinemia, trypsinogen 
deficiency disease, chronic and heriditary pancreatitis, enterkinase defieciency, cancer and/or 
related pathologies and disorders and/or other pathologies and disorders. For example, a cDNA 
encoding the Trypsin-like protein may be useful in gene therapy, and the Trypsin-like protein 
may be useful when administered to a subject in need thereof. By way of nonlimiting example, 

1 5 the compositions of the present invention will have efficacy for treatment of patients suffering 
from failure to thrive, nutritional edema, and hypoproteinemia, trypsinogen deficiency disease, 
chronic and heriditary pancreatitis, enterkinase defieciency, cancer. The novel nucleic acid 
encoding Trypsin-like protein, and the Trypsin-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of the 

20 nucleic acid or the protein are to be assessed. These materials are further useful in the generation 
of antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 



MOL7 

25 A novel nucleic acid encoding a Kallikrein-like-protein MOL7 was identified by TblasfN 

using CuraGen Corporation's sequence file for MOL7 probes or homologs, and run against the 
Genomic Daily Files made available by GenBank. The nucleic acid was further predicted by the 
program GenScan™, including selection of exons. These were further modified by means of 
similarities using BLAST searches. The sequences were then manually corrected for apparent 

30 inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
novel MOL7 nucleic acid of 181 1 nucleotides (also referred to as 30675745.0.499) is shown in 
Table 7 A. An open reading frame begins with an ATG initiation codon at nucleotides 368-370 
and ends with a TAG codon at nucleotides 1553-1 555. A putative untranslated region upstream 
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from the initiation codon and downstream from the termination codon are underlined in Table 
7 A, and the start and stop codons are in bold letters. 



Table 7A. MOL7 Nucleotide Sequence (SEQ ID NO:15) 

ACAAATCCTTCTGTTGAACTCTACTGTGTCAGGCCAGCCTGAGTTCATTTCTCCTTGAGCAGGAACAGTT 
CAT GGACGAAC TCTGAGGACCAT T CTG AGGACAAGAGGCATCCAGT GTCAT GAGT GG AACATGC AGCATT 
TTATGGCTACAGAGTTAAGGCAAGGGTTGAATTCCACGAGTCAAAAAGCAGCCCTTTTCAGAGACCCAAC 
TCTCTGGGGTGCTCAGGGGCTTGGGCTGGATTGAGAAGAAAACTGACAAGAGTAAGCTGCCCTCTCTTCT 
CTGGCCATCTCACAAACCACAGTGCGGGCCAACTGGTCCTGCCTCTTTACCACACAGAACCAAGCACTAG 
GGATAAGACAGCTGCCCA TGGTGTCCGCGGCGGGTCTCTCTGGGGATGGCAAGATGCGAGGGGTGCTCCT 
GGTGCTGCTCGGCCTTCTCTATTCTTCCACCAGTTGTGGCGTCCAGAAAGCTTCCGTTTTCTACGGTCCT 
GACCCCAAGGAGGGCTTGGTCAGCAGCATGGAGTTCCCGTGGGTGGTGTCGCTGCAGGACTCCCAGTACA 
CACACCTGGCTTTCGGCTGCATCCTGAGCGAGTTCTGGGTCCTCAGCATCGCATCCGCCATTCAGAACAG 
GAAGGACATTGTCGTTATAGTGGGTATAAGTAACATGGATCCTAGCAAGATTGCTCACACAGAGTATCCA 
GTCAATACCATCATCATCCATGAGGACTTTGATAACAACTCCATGAGCAACAACATAGCCCTCCTGAAGA 
CAGACACAGCGATGCATTTTGGCAACCTGGTCCAGTCCATCTGCTTCCTCGGCAGAATGCTGCATACACC 
ACCAGTCTTGCAGAACTGCTGGGTGTCAGGATGGAATCCCACATCTGCAACAGGAAATCACATGACGATG 
AGTGTCCTGAGGAAAATCTTCGTGAAAGATCTTGACATGTGTCCCCTATACAAACTCCAGAAGACAGAAT 
GCGGCAGCCACACGAAAGAGGAAACCAAGACTGCCTGCTTGGGGGACCCAGGAAGCCCAATGATGTGCCA 
GCTACAGCAGTTCGATCTGTGGGTTCTGAGAGGAATCCTGAACTTCGGTGGTGAGACGTGCCCTGGCCTG 
TTTCTGTACACCAAGGTGGAAGACTACAGCAAATGGATCACATCCAAGGCTGAGAGGGCCGGCCCTCCCC 
TGTCCTCACTCCACCACTGGGAAAAGTTGATTTCTTTCTCCCACCATGGACCAAATGCCGCCATGACACA 
GAAGACATATTCTGATTCTGAACTGGGCCATGTTGGATCATACTTGCAGGGACAAAGAAGGACCATCACG 
CATTCACGACTAGGAAACAGCTCTAGAGATAGTCTAGATGTTAGGGAGAAGGATGTAAAGGAATCAGGCA 
GGTCTCCTGAGGCGTCTGTACAACCCTTATACTATGACTATTACGGTGGGGAGGTGGGGGAAGGTAGGAT 
TTTTGCAGGTCAGAACAGGTTGTATCAGCCCGAAGAAATCATCTTGGTTTCCTTCGTGCTTGTTTTCTTT 
TGCAGCAGTATCTA GTCCAGGAGCTACCCCACCAAACTGAAGAGTAAACTGAGAATGCTGAGTGCCAGGC 
ATTCACCATGCTGTTTTGATGTCTGTTTTTGATAGTTGCACACTGGGGCTGCCACGGATAAGCCCATGGC 
ATACACTGGGCTGGCTCTCCCTCCTCTATCCCTCTCCCAGGTGTGGGAAGGTCACTTTCACTATGCTTGT 
GAACTAAATGCTGGCTAACAAGTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



The MOL7 protein encoded by SEQ ID NO: 1 5 has 395 amino acid residues, and is 
presented using the one-letter code in Table 7B (SEQ ID NO: 1 6). The SignalP, Psort and/or 
Hydropathy profile for MOL7 predict that MOL7 has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0. 9190. The SignalP shows a signal 
10 sequence is coded for in the first 44 amino acids with the most likely cleavage site being between 
amino acids 30 and 3 1 . This is typical of this type of membrane protein. The molecular weight 
of MOL7 is 43S15.7 Daltons. 



Table 7B. Encoded MOL7 protein sequence (SEQ ID NO: 16). 

MVSAAGLSGIX3KI4RG VXJjVLLGIJj YSSTS CGVQKAS VFYGPDP KE GLVS SME F P WWSLQDS QYTHLAFGC I LS 
EFWVLS IASAIQNRKDIWIVGI SNMDPSKI AHTEYPVNTI I IHEDFDNNSMSNNIALLKTDTAMEFGNLVQSI 
CFLGRMLHTP PVI^NCWVSGWNPTSATGNHMTMSVLRKI fvkdldmcpl yklqkte cgs htke etktaclgd pg 
SPMMCQLG£FDLWVLRGILBnPGG 

TQKTYSDSBIjGHVGS YLQGQRRTITHS RLGNS SRDSLDVREKDVKESGRS PEAS VQPLYYDYYGGB VGEGRI PA 
GQNRLYQPEE 1 1 LVS FVLVFFCSSI 
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MOL7 was found to have 290 of 290 amino acid residues (100 %) identical to the 290 
amin o acid residue hypothetical 32.6 kD protein from Homo sapiens (human) 
(ACC:CAB70765). This protein has similarity to kallikrein. 

MOL7 shows significant homologies to human hypothetical 32.6 kD protein (a protein 
5 with similarities to Kallikrein), as described in, but not limited to, the references below. 

Kallikreins are a subgroup of serine proteases and these proteolytic enzymes have diverse 
physiological functions in many tissues. Growing evidence suggests that many kallikreins are 
implicated in carcinogenesis. The human kallikrein gene family is localized on chromosome 
19ql3.3-ql3.4 and currently includes three members: KLK1 or pancreatic/renal kallikrein, 
1 0 KLK2 or human glandular kallikrein and KLK3 or prostate-specific antigen (PSA). The latter 
two genes are almost prostate-specific and they are used for diagnosis and monitoring of prostate 
cancer and more recently, in breast cancer applications (Yousef et aL Anticancer Res 1999 Jul- 
Aug;19(4B):2843-52). These new genes, like the already known kallikreins, may have utility for 
diagnosis, monitoring and therapeutics of various cancers including those of the breast, prostate 
15 and testis. 

Monsees et aL, Immunopharmacology 1999 Dec;45(l-3):107-14, found that elements of 
the kallikrem-kinin system are present in rat seminiferous epithelium. Peptide hormones are 
involved in the paracrine regulation of several physiological processes. The paracrine peptide 
system may play a role in the regulation of Sertoli cell function or in the Sertoli cell-germ cell 

20 crosstalk, and therefore, be involved in mammalian reproduction, especially spermatogenesis. 

Chen et aL t J Biol Chem 1996 Nov l;271(44):27590-4 5 found that the kallikrem-kinin 
system participates in blood pressure regulation. One of the kallikrein-kinin system components, 
kallikrein-binding protein, binds to tissue kallikrein and inhibits its activity in vitro. 

The glandular kallikreins are a distinct group of serine proteases with a molecular weight 

25 of 25,000-40,000 and an ability to release vasoactive peptides from kininogen in vitro, although 
the kininogenase activity of different kallikreins is highly variable. The true physiologic role of 
specific kallikreins is often unrelated to the kininogenase activity. In the mouse a major site of 
kallikrein synthesis is the male submaxillary gland. Glandular kallikreins are also synthesized in 
the pancreas and kidney. The several kallikreins found in this tissue include epidermal growth 

30 factor binding protein (EGF-BP) and the gamma subunit of nerve growth factor (NGFG; 
162040) which are responsible for the processing of EGF (13 1530) and NGF (162030), 
respectively. Although EGF-BP and NGFG exhibit strict substrate specificity, they share 
extensive amino acid sequence homology and immunologic crossreactivity. Mason et al. (1983) 
concluded that the glandular kallikrein gene family comprises 25-30 highly homologous genes 
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that encode specific proteases involved in the processing of biologically active peptides. All are 
closely linked on mouse chromosome 7 (assignment by Chinese hamster-mouse hybrid cell 
studies). Several human kallikrein genes have been isolated. 

Schedlich et al (1 987) described a human glandular preprokallikrein gene, hGK-1, 
5 isolated from a human genomic library. The 5.2-kb gene encodes a prepropeptide of 261 amino 
acids. The mature protein is 237 amino acids long and has 66% homology with the sequence 
predicted for the human kallikr ein synthesized in pancreas, kidney, and salivary gland (KLK1 ; 
147910). Seventy-three percent homology with human prostate-specific antigen (APS; 176820) 
was observed. Expression of the glandular kallikrein gene, like that of the APS gene, seems to be 

10 restricted to the prostate. Riegman et al (1 989) found that the glandular kallikrein gene and that 
for prostate-specific antigen are aligned in a head-to-tail orientation and are separated by about 
12 kb. Southern blot analysis of DNA from a panel of human-hamster hybrid cells showed that 
the genes are situated on chromosome 19. Since the KLK1 gene is also on chromosome 19, these 
3 genes probably represent a cluster. From in situ hybridization studies, Qin et al (1991) 

1 5 concluded that the glandular kallikrein gene and probably other kallikrein genes are located in 
ql3.3 and ql3.4 bands of chromosome 19 and are probably near the border of these two bands. 

Therapeutic applications 

20 The expression pattern, and protein similarity information for MOLT suggest that it may 

function as human Kallikrein-like protein. Therefore, the nucleic acid and protein of the 
invention are useful in potential therapeutic applications implicated, for example but not limited 
to, various cancers including those of the testis, prostate, and breast; mammalian reproduction, 
especially spermatogenesis; blood pressure regulation; and other diseases and disorders. The 

25 homology to antigenic secreted and membrane proteins suggests that antibodies directed against 
the novel genes may be useful in treatment and prevention of various cancers including those of 
the testis, prostate, and breast, mammalian reproduction, especially spermatogenesis; blood 
pressure regulation; and other diseases and disorders. 

Potential therapeutic uses for the inventions) are, for example but not limited to, the 

30 following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 

(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 
tissues and cell types derived from these tissues. 
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The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various cancers including those of the testis, prostate, and breast; 
mammalian reproduction, especially spermatogenesis; blood pressure regulation;and other 
diseases and disorders. For example, but not limited to, a cDNA encoding the novel human 

5 plasma membrane protein may be useful in gene therapy, and the novel human plasma 

membrane protein may be useful when administered to a subject in need thereof. By way of 
non-limiting example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, but not limited to, various cancers including those of the 
testis, prostate, and breast; mammalian reproduction, especially spermatogenesis; blood pressure 

10 regulation; and other diseases and disorders. The novel nucleic acid encoding the novel human 
plasma membrane protein, and the novel human plasma membrane protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. These materials are further useful in 
the generation of antibodies that bind immunospecifically to the novel substances of the 

1 5 invention for use in therapeutic or diagnostic methods. 

MOL8 

MOL8a 

A novel human Acetyl LDL Receptor-like nucleic acid was identified by TblastN using 
20 CuraGen Corporation's sequence file for MOL probes or homologs and run against the Genomic 
Daily Files made available by GenBank. The nucleic acid was further predicted by the program 
GenScan™, including selection of exons. These were further modified by means of similarities 
using BLAST searches. The sequences were then manually corrected for apparent 
inconsistencies, thereby obtaining the sequences encoding the foil-length protein. The disclosed 
25 novel MOL8a nucleic acid of 980 nucleotides (also referred to as 1 1 800699-0-16) is shown in 
Table 8B. An open reading frame begins with an ATG initiation codon at nucleotides 1 -3 and 
ends with a TGA codon at nucleotides 2803-2805. A putative untranslated region downstream 
from the termination codon are underlined in Table 8 A, and the start and stop codons are in bold 
letters. 

30 

Table 8A. MOL8a Nucleotide Sequence (SEQ ID NO:17) 

ATGGAGGGCGCAGGGCCCCGGGGGGCCGGGCCGGCGCGGCGCCGGGGAGCCGGGGGGCCGCCGTCACCGCTGCTGC 
CGTCGCTGCTGCTGCTGCTGCTGCTCTGGATGCTGCCGGACACCGTGGCGCCTCAGGAA.CTGAACCCTCGCGGCCG 
CAACGTGTGCCGTGCTCCCGGCTCCCAGGTGCCCACGTGCT^ 

ATTGCGGTGTGCGAAGGCARCTCCACGTGCTCAGAGAACGAGGTGTGCGTGAGGCCTGGCGAGTGCCGCTGCCGCC 
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ACGGCTACTTCGGTGCCAACTGCGACACCAGTGAGCGTGGGGTCGGGCCGGTATTGGTCGGTGGGGCGGAATCCTG 
GAGAGATGGGGCGGGGTCQ^AGGTGGGGCGGGGTCGGATCC^ 

GTGCGGGACGCGGGCAGGTTCCGGCTGGCTGGGGGCACCTACTCAAGCACCX^GCCTTCCACCCCCTCCGCTCCT 
CCCCTGCAGAGTGCCCGCGCCAGTTCTC^ 

CGAGGACGTGACAGGCCAGTGTACTTGTCACGCGCGGCGCTGGGGCGCGCGCTGCGAGCATGCGTGCCAGTGCCAG 
CACGGCACGTGCCACCCCCGGAGCGGCGCGTGCCGCTGTGAGTCCGGCTGGTGGGGCGCGCAGrGCGCCAGCGCGT 

gctactgcagcgccacgtcgcgctgcgacccacagaccggcgcctgcctgtgccacg 

c tgcaacaaccagtg cgcctgcaactcgtctccctgcgagcagcagagcg sccgctgtcagtgccgcgagcgtacg 

titcggcgcgcgctgcgatcgctactgccagtgcttccgcggccgctgccacccl'gtggacggcacgtgtgcctgcg 

agccgggctaccgcggcaagtactgtcgcgagccgtgccccgccggcttctacggcttgggctgtcgccgccggtg 

tggccagtgcaagggccagcagccgtgcacggtggccgagggcc^ 

accaagtg c gaccag cc t tgcgccaccgg ttt c t atggcgagggc tg cagccaccgctgtcc gc catgcc gc g ac g 
ggcatgcctgtaaccatgtcaccggcaagtgtacgcgctgcaacgcgggctggatcggcgaccggtgcgagaccaa 
gtgtagcaatggcacttacggcgaggactgcgccttot 
tcggggcgctgcctgtgcmccctggcgtcc^^ 

cggactgtgctcaggcctgcagctgccacgaggacacgtgcgacccggtcactggtgcctgccacctagaaaccaa 

ccagcgcaagggcgtgatgggcgcgggcgcgctgctcgtcctgctcgtctgcctgctgctctcgctgctcggctgc 

tgctgcgcttgccgcggcaaggaccctacgcgccgggagctttcgcttgggaggaagaaggcgccgcaccgactat 

gcgggcgcttrag?cgcatcagcatgaagctgccccggat^ 

ggcccaccacgacctggataacacactcaactgcagcttot 

tcctggtcctctcgggcctccttctcctcgtttgacaccactgatgaaggccctgtgtactgtgtaccccatgagg 
aggc&ccagcggagagccgggaccccgaagtccccact^ 

gaccacgccagcctccgccgaggaggcgatacccctccccgcgtcctccgacagcgagcggtcggcgtccagcgtg 
gaggggcccggaggggctctgtacgcgcgcgtggcccgacgcgaggcccggccggcccgggcccggggcgagattg 
ggggcctgtcgctgtcgccatcgcccgagcgcaggaaaccgccgccacctgaccccgccaccaagcctaaggtgtc 
ctggatccacggcaagcacagcgccgctgcagctggccgtgcgccctcaccaccgccgccaggctccgaggccgcg 
cccagccccagc aagaggaaacggacgccc ag cgacaaat cggcgcat acggtcgaacacgg cagcccccggaccc 

GCGACCCAACGCCGCGCCCCCCGGGGCTGCCCGAGGAGGCGACAGCCCTCGCTGCGCCCTCGCCGCCCAGGGCCCG 
AGCGC^C^CGCGCCCCGGCCTCTTGGAGCCCACGGACGCCGGCGGTCCCCCGK^GAAGCGCGCCGAGGCTGCCTCC 
ATGTTGGCCGCTGACGTGCGCGGCAAGACTCGCAGCCTGGGCCGCGCCGAGGTGGCCCTGGGCGCGCAGGGCCCCA 
GGGAAAAGCCGGCGCCCCCACAGAAAGCCAAGCGCTCCG^ 

CGAAACCCCGGGGCCTGAGAAGGCGGCGACCGACTTGCCCGCGCCTGAGACCCCCCGGAAGAAGACCCCCATCCAG 
AAGCCGCCGCGCAAGAAGAGCCGGAGGCGGCGGGCGAGCTGGGCAGGGCGGGCGCACCCACCCTGTAg - 



The MOL8a protein encoded by SEQ ID NO : 1 6 has 324 amino acid residues, and is 
presented using the one-letter code in Table 8B (SEQ ID NO: 1 8). The SignalP, Psort and/or 
Hydropathy profile for MOL8a predict that MOL8a has a signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.6000. The SignalP shows a signal 
sequence with a cleavage site between amino acids 43 and 44. This is typical of mis type of 
membrane protein. Therefore it is likely that this novel human plasma membrane protein is 
available at the appropriate sub-cellular localization and hence accessible for the therapeutic uses 
described in this application. 

10 



Table 8B. Encoded MOL8a protein sequence (SEQ ID NO:18). 

MEGAG PRGAG PARRRGAGGP PS PLLPSLLLLLLLWML PDTVAPQELNPRGRNVCRAPGSQVPTCCAGV7RQQGDECG 
I AVCE GN S TC SENEVC VRPGECRCRHGYFGANCDT SE RGVGPVLVGGAE S WRDGAGS KVGRGRI RLRGG S PEVAAG 
VRDAGRFRLAGGTYSSTGAFHPLRS3PAECPRQFWGPDCKELCSCHEHGQCEDVTGQCTCHARRWGARCEHACQCQ 
HGTCHPRSGACRCESG^GAC^^ACYCSATSRCDPQTGACLCHAGWGRSCNNQCACNSSPCEQQSGRCQCRERT 
FGARC DRYCQCFl^GRCHPVDGTC ACE PGYRGKYCRE PCPAGFYGMCRRRCGG^KGQQPCTVAEGRCLTCE PGWNG 
TKCIXJPCATGFYGEGCSHRCPPCRDGHACIJHVTGKCTRCNAGWIGDRCETKCSNGTYGEIX^FVCADCGSGHCDFQ 
SGRCLCS PGVHGPHCNVTC P PGLHGADCAQAC S CHE DTC D PVTGACHLETNQRKGVMGAGALLVLLVCLLLS LLGC 
CCACRGKDPTRRELSLGRKKAPHRLCGRFSRISI^PRIPI^C^ 

SWSSRASFSSFDTTDEGPVYCVPHEEAPAESRDPEVPTVPAEAPAPSPVPLTTPASAEEAIPLPASSDSERSASSV 
EGPGGALYARVARREARPARARGEIGGiSLSPSPERRKPPPPDPATKPKVSWIHGKHSAAAAGRAPSPPPPGSEAA 
PS PSKRKRTPSDKSAHTVEHGS PRTRDPTFRPPGLPEEATALAAPSPPRARARAAPRPLGAHGRRRSPAKRAEAAS 
MLAADVRGKT RSLGRAE VALGAQG PREKPAP PQKAKRS VP PAS PARAPPATET PGPEKAAT DLPAPET PRKKT PIQ 
KPPRKKSREAAGELGRAGAPTL 
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The full amino acid sequence of the protein of the invention was found to have 576/729 
(79%) identical and 596/729 (81%) similarity to a murine nurse cell receptor amino acid 
sequence (PatP Accession No. Y85616). The full amino acid sequence of the protein of the 
5 invention was also found to have 296 of 741 amino acid residues (39 %) identical and 383 of 741 
amino acid residues (51 %) homolog to the 830 amino acid residue acetyl LDL receptor 
precursor from Homo sapiens (human) (ACC:O43701). 

MOL8a is expressed in the following tissues: fetal thymus, mammary gland, fetal 
thymus, pool of ten tissues (adrenal, mammary, prostate, testis, uterus, bone marrow*, 
10 melanoma*, pituitary*, thyroid*, spleen) (*from mRNA rather than from total RNA). 



MOL8b 

A novel nucleic acid was identified by laboratory cloning of cDNA fragments, by in 
silico prediction of the sequence. cDNA fragments covering either the full length of the DNA 

15 sequence, or part of the sequence, or both, were cloned. In silico prediction was based on 
sequences available in Curagen's proprietary sequence databases or in the public human 
sequence databases, and provided either the full length DNA sequence, or some portion thereof. 
These were further modified by means of similarities using BLAST searches. The sequences 
were then manually corrected for apparent inconsistencies, thereby obtaining the sequences 

20 encoding the full-length protein. The disclosed novel MOL8b nucleic acid of 2598 nucleotides 
(also referred to as CG50889-02) is shown in Table 8C. An open reading frame begins with an 
ATG initiation codon at nucleotides 1-3 and ends with a TAG codon at nucleotides 2596-2598. 
A putative untranslated region upstream from the initiation codon and downstream from the 
termination codon are underlined in Table 8D, and the start and stop codons are in bold letters. 

25 



Table 8D. MOL8b Nucleotide Sequence (SEQ ID NO:19) 

ATGGAGGGCGCAGGGCCCCGGGGGGCCGGGCCGGCGCGGCGCCGGGGAGCCGGGGGGCCGCCGTCACCGCTGCTGC 
CGTCGCTGCTGCTGCTGCTGCTGCTCTGGATGCTGCCGGACACCGTGGCGCCTCAGGAACTGAACCCTCGCGGCCG 
CAACGTGTGCCGTGCTCCCGGCTCCCAGGTGCCCACGTGCTGCGCTGGCTGGAGGCAGCAAGGGGACGAGTGTGGG 
ATTGCGGTGTGCGAAGGCAACTCCACGTGCTCAGAGAACGAGGTGTGCGTGAGGCCTGGCGAGrGCCGCTGCCGCC 
ACGGCTACTTCGGTGCCAACrGCGACACa^GTGCCCGCGCCAGTTCTGGGGCCCCGACTGCAAGGAGCTGTGTAG 
CT GCCACCCACACGGGCAGTGCGAGGACGTGACAGGCCAGTGTACTTGTCACGC GCG GCGCTGGGGCGCGCGCT G C 
GAGCATGCGTGCCAGTGCCAGCACGGCACGTGCCACCCGCGGAGCGGCGCGTGCCGCTGTGAGCCCGGCTGGTGGG 
GCGCGCAGTGCGCCAGCGCGTGCTACTGCAGCGCCACGTCGCGCTGCGACCrACAGACCGGCGCCTGCCTGTGCCA 
CGCAGGCTGGTGGGGCCGCAGCTGCAACAACCAGT^ 

TGTCAGTGCCGCGAGCGTACGTTCGGCGCGCGCTGCGATCGCTACTGCCAGTGCTTCCGCGGCCGCTGCCACCCTG 
TGGACGGCACGTGTGCCTGCGAGCCGGGCTACCGCGGCAAGTACTGTCGCGAGCCGTGCCCCGCCGGCTTCTACGG 
CCTGGGCTGTCGCCGCCGGTGTGGCCAGTGCAAGGGCCAGCAGCCGTGCACGGTGGCCGAGGGCCGCTGCTTGACG 
TGCGAGCCCGGCTGGAACGGAACCAAGTGCGACC^GCCTTGCGCCACCGGTTTCTATGGCGAGGGCTGCAGCCACC 
GCTGTCCGCCATGCCGCGACGGGCATGCCTGTAACCATGTCACCGGCAAGTGTACGCGCTGCAACGCGGGCTGGAT 
CGGCGACCGGTGCGAGACCAAGTGTAGCAATGGCACTTACGGCGAGGACTGCGCCTTCGTGTGCX^CGACTGCGGC 
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AGCGGACACTGCGACTTCCAGTCGC-GGCGCTGCCT^ 

GCCCGCCCGGACTCCACGGCGCGGACTGTGCTCAGGCCTGCAGCTGC^CGAGGACACGTGCGAOXXGTCACTGG 

TGCCTGCCACCTAGAAACCAACCAGCGCAA^ 

CTGCTCTCGCTGCTCGGCTGCTGCTGCGOT 

AGAAG GCGC C GCAC CGACT ATGCGGGCGCTT CAGTCGCAT CAGCAT GAAGCTG CCC C GGATC CCGCTC C G GAGGCA 
GAAACTACCCAAAGTCGTAGTGGCCCACCACGACCTGGATAAQACACTCAACT 

GGGCTG<^GCAGCCCTCACCATCCTGGTCCTCTCGGGCCTCCTTCTCCTCGTTTGACACCACTGATGAAGGCCCTG 

TGTACTGTGTACCCCATGAGGAGGCACCAGCGGAGAGCCGGGACCCCGA 

GGCGCCGTCCCCTGTGCCCTTGACCACGCCAG 

GAGCGGT<X3GCGTCCAGCGTGGAGGGGCCCGGAGGGGCTCTGT^^ 

CCCG^GCCCGGGGCGAGATTGGGGGCCTGTCGCTGTCGC 

CGCCACCAAGCCTAAGGTGTCCTGGATCCACGGCAAGCACAGC^ 

CCGCCAGGCTCCGAGGCCGCGCCCAGCCCCAGCAA^ 

AACACGGCAGCCCCCGGACCCGCGACCCAACGCCGCGCCCCC^ 

GCCCTCGO:GCCCAGGGCCCGAG€GCGCGCCGCGCCCCGGCCTCTTGGAGCCCACGGACGCCGGCGGTCCCCCGCG 
AAGCGCGCCGAGGCTGCCTCCATGTTGGCCGCTGACGTGCGCGGCAAGACTCGCAGCCTGGGCCGCGCCGAGGTGG 
CCCTGGGCX3CGCAGGGCCCCAGGGAAAAGCCGGCGCCCCCACAGAAAGCCAAGCGCTCCGTGCCGCCAGCCTCGCC 
CGCCCGCGCGCCCCCAGCGACCGAAACCCCGGGGCCTGA 

C GGAAGAAGACC C C CAT CCAGAAG CC G C CGCGC AAG AAGAG CC GG GAGG CGGCGGGC GAG CT GGGCAGGG CGGG C G 
CACCCACCCTGTAG 



The disclosed nucleic acid sequence has 1 3 1 1 of 2041 bases (64%) identical to a 
gb:GENBANK-ID:D86864|acc:D86864.1 mRNA from Homo sapiens (Homo sapiens mRNA for 
acetyl LDL receptor, complete cds) (E value = 3 .2e- 98 ). 
5 The MOL8b protein encoded by SEQ ID NO: 17 has 865 amino acid residues, and is 

presented using the one-letter code in Table 8E (SEQ ID NO:20). The SignalP, Psort and/or 
Hydropathy profile for MOL8a predict that MOL8a has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0.6000. The SignalP shows a signal 
sequence with a cleavage site between amino acids 43 and 44. This is typical of this type of 
1 0 membrane protein. Therefore it is likely that this novel human plasma membrane protein is 

available at the appropriate sub-cellular localization and hence accessible for the therapeutic uses 
described in this application. 



Table 8E. Encoded MOL8b protein sequence (SEQ ED NO:20). 

MEGAGPRGAGPARRRGAGGPPSPLLPSLLLLLLLWMLPDTVAPQELNPRGRNVCRAPGSQVPTCCAGWRQQGDECG 

IAVCEGNSTCSENEVCVRPGECRCRHGYFGANCDTKCPRQFWGPDCKELCSCHPHGQCEDVTGQCTCHARRWGARC 

EHACQCQHGTCH PRS G ACRCE PGWWGAQC AS ACYCS AT S RCD PQTGACLC HAGWWGRS CNNQCACNS S PCEQQSGR 

CQCRE RT FGARCDRYCQCFRGRCHPVDGTCACEPGYRGKYCRE PC PAGFYGLGCBRRCGQCKGQQPCTVAEGRCLT 

CEPGWNGTKCDQPCATGFYGEGCSHRCPPCRDGHACNHVTGKCTRCNAGWIGDRCETKCSNGTYGEDCAFVCADCG 

SGHCDFQSGRCLCSPGVHGPHCNVTCPPGLHGAIX^^ 

LLSLLGCCCACRGKDPTRRELSLGFJTCAPHRLCG^ 

GLEQPSPSWSSRASFSSFDTTDEGPVYCVPHEEAPAESRDPEVPTVPAEAPAPSPVPLTTPASAEEAIPLPASSDS 
ERSASSVEGPG<aALYARVARREARPAP^ARGEIGGLSLSPSPERRKPPPPDPA^KPKVSWIHGKHSAAAAGRAPSPP 
PPGSEAAPSPSKRKRTPSDKSAHT^HGSPRTRDPTPRPPGXPEEATALAAPSPPRARARAAPRPLGAHGRRRSPA 
KRAEAASMIAADVRGKTRSLGRAEVALGAQGPREKPAPPQKAKRSVPPASPARAPPATETPGPEK^ 
RKKTPIQKPPRKKSREAAGELGRAGAPTL 



The full amino acid sequence of the protein of the invention was found to have 340 of 
15 823 amino acid residues (41%) identical to, and 443 of 823 amino acid residues (53%) similar to, 
the 830 amino acid residue ptnr:SPTREMBL-ACC:O43701 protein from Homo sapiens 
(Human) (ACETYL LDL RECEPTOR PRECURSOR) (E value = 9.0e- 164 ). 
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Homology between MOL8a, 8b and the human acetyl LDL receptor are presented 
graphically in the multiple sequence alignment given in Table 8F (with MOL8a being shown on 
line 1, and MOL8b being shown on line 2) as a ClustalW analysis comparing MOL8 with related 
protein sequences. 



Table 8F. Information for the ClustalW proteins: 

1) M0L8a (SEQ ID NO: 18) 

2) M0L8b (SEQ ID NO: 20) 

3) gi I 4507203|ref |NP_003684.1 I acetyl LDL receptor; SREC=scavenger receptor 
expressed by endothelial cells [Homo sapiens] (SEQ ID NO: 57) 
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Chromosomal information: 

MOL8 maps to chromosome 22ql 1 . This assignment was made using mapping 
information associated with genomic clones, public genes and ESTs sharing sequence identity 
with the disclosed sequence and CuraGen Corporation's Electronic Northern bioinformatic tool. 

Tissue expression 

MOL8 is expressed in at least the following tissues: kidney, senescent fibroblasts, 
lymphocyte, B cell, and germ cell tumors. Expression information was derived from the tissue 
sources of the sequences that were included in the derivation of the sequence of CuraGen Acc. 
No. CG50889-02. 

MOL8 shows significant homologies to human LDL Receptor-like protein, as described 
in, but not limited to, the references below. Hypercholesterolemia is an autosomal dominant 
disorder characterized by elevation of serum cholesterol bound to low density lipoprotein (LDL). 
Mutations in the LDL receptor (LDLR) gene on chromosome 19 cause this disorder. Familial 
hypercholesterolemia is characterized by elevation of serum cholesterol bound to low density 
lipoprotein (LDL) and is, hence, one of the conditions producing the hyperlipoproteinemia II 
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phenotype (see OMIM 144400). Heterozygotes develop tendinous xanthomas, corneal arcus, and 
coronaiy artery disease; the last usually becomes evident in the fourth or fifth decade. 
Homozygotes develop these features at a accelerated rate m addition to planar xanthomas, which 
may be evident at birth in the web between the first two digits. 
5 Hepatitis C virus (HCV), the principal viral cause of chronic hepatitis, is not readily 

replicated in cell culture systems, making it difficult to ascertain information on cell receptors for 
the virus. However, several observations from studies on the role of HCV in mixed 
cryoglobulinemia provided some insight into HCV entry into cells. Evidence indicated that HCV 
and other viruses enter cells through the mediation of LDL receptors: by the demonstration that 

1 0 endocytosis of these viruses correlates with LDL receptor activity, by complete inhibition of 
detectable endocytosis by anti-LDL receptor antibody, by inhibition with anti-apolipoprotein E 
and anti-apolipoprotein B antibodies, by chemical methods abrogating lipoprotein/LDL receptor 
interactions, and by inhibition with the endocytosis inhibitor phenylarsine oxide. Agnello et al. 
(1999) provided confirmatory evidence by the lack of detectable LDL receptor on cells known to 

15 be resistant to infection by one of these viruses, bovine viral diarrheal virus (BVDV). 

Endocytosis via the LDL receptor was shown to be mediated by complexing of the virus to very 
low density lipoprotein (VLDL) or LDL, but not high density lipoprotein (HDL). Studies using 
LDL receptor-deficient cells or a cytolytic BVDV system indicated that the LDL receptor may 
be the main but not exclusive means of cell entry of these viruses. 

20 Therapeutic uses of the composition 

The expression pattern, and protein similarity information for MOL8 that it may function 
as human LDL Receptor-like protein. Therefore, the nucleic acid and protein of the invention are 
useful in potential therapeutic applications implicated, for example but not limited to, metabolic 
disorders, e.g. Hypercholesterolemia, viral diseases, and other diseases and disorders. The 

25 homology to antigenic secreted and membrane proteins suggests that antibodies directed against 
the novel genes may be useful in treatment and prevention of metabolic disorders, e.g. 
Hypercholesterolemia, viral diseases, and other diseases and disorders. 

Potential therapeutic uses for the invention(s) are, for example but not limited to, the 
following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 

30 (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 
tissues and cell types derived from these tissues. 
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The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various cancers including those metabolic disorders, e.g. 
Hypercholesterolemia, viral diseases, and other diseases and disorders. For example, but not 
limited to, a cDNA encoding the novel human plasma membrane protein may be useful in gene 
5 therapy, and the novel human plasma membrane protein may be useful when administered to a 
subject in need thereof. By way of non-limiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from, for example, but not limited 
to, various cancers including those of the metabolic disorders, e.g. Hypercholesterolemia, viral 
diseases, and other diseases and disorders. The novel nucleic acid encoding the novel human 

10 plasma membrane protein, and the novel human plasma membrane protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. These materials are further useful in 
the generation of antibodies that bind immunospecifically to the novel substances of the 
invention for use in therapeutic or diagnostic methods. 

1 5 These materials are further useful in the generation of antibodies that bind immuno- 

specifically to the novel MOL8 substances for use in therapeutic or diagnostic methods. These 
antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL8 protein has multiple hydrophilic regions, each of which can be 

20 used as an immunogen. In one embodiment, a contemplated MOL8 epitope is from about amino 
acids 1 to 10. In another embodiment, a MOL8 epitope is from about amino acids 50 to 200. In 
further embodiment, a MOL8 epitope contains amino acids 210-400, 475-600, or 625-850. These 
novel proteins can also be used to develop assay system for functional analysis. 

25 MOL9 

MOL9a 

A novel nucleic acid encoding a neurolysin -like protein was identified by TblastN using 
CuraGen Corporation's sequence file for MOL9 probe or homolog, run against the Genomic 
Daily Files made available by GenBank. The nucleic acid was further predicted by the program 
30 GenScan™, including selection of exons. These were further modified by means of similarities 
using BLAST searches. The sequences were then manually corrected for apparent 
inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
novel MOL9a nucleic acid of 2355 nucleotides (also referred to as 19506719_BJEXT) is shown 
in Table 9A. An open reading frame begins with an ATG initiation codon at nucleotides 1-3 and 
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ends with a TGA codon at nucleotides 1915-1917. A putative untranslated region upstream from 
the initiation codon and downstream from the termination codon are underlined in Table 9 A, and 
the start and stop codons are in bold letters. 

The nucleic acid sequence has 1307 of 1428 bases (91%) identical to a pig neurolysin 
5 mRNA (GENBANK-ID: AB000170) (Expect = 0.0). 



Table 9A. MOL9a Nucleotide Sequence (SEQ ID NO:21) 

ATGTTGACTTTGGACCAACAGAAATCCCTAATTCTTATTCTTTTTCTGATTCTTTTTAGAGTTGGTGGTTC 
CAGGATT TTACTCAGAAT GACGTTAGGAAGAGAAGTGATGTCTCCTCT TCAGGCAATGTCT TCCTATACT G 
TGGCTGGCAGAAATGT TT TAAGAT GGGATCTTTCACC AGAGCAAATT AAAACAAGAACTGAGGAGCrCAT T 
GT GCAGACCAAACAGGTGTACGAT GCTGTTGGAATGCTCGGTATT GAGGAAGT AACTTACGAGAACTGTCT 
GCAGGCACTGGCAGTGGAAAGGAC CATGCT AGACTTT CCCCAGCATGTAT CCTCT GAC AAAGAAGTACGAG 
CAGCAAGTACAGAAGCAGACAAAAGACTT TCT CGT TT TGATAT T GAGATG AGCATGAGAGGAGATATATTT 
GAGAGAATTGTT CATT TACAGCAGGAAACCTGTGATCTGGGGAAGATAAAACCTGAGGCCAGACGATACTT 
GGAAAAGTCAATTAAAATGGGGAAAAGAAATGGGCTCCATCTTCCTGAACAAGTACAGAATGAAATCAAAT 
CAATGAAGAAAAGAATGAGTGAGCTATGTATTGATTTTAACAAAAACCTCAATGAGGATGATACCTTCCTT 
GTATTTTCXMGXTCTGAACTTGGTGCTCTTCCTGATGAITOCA^ 

CAAGTATAAAATTACCTTAAAATATCCACACTATTTCCCTGTCATGAAGAAATGTTGTATCCCTGAAACCA 
GAAGAAGGATGGAAATGGCTTTTAATACAAGGTGCAAAGAGGAAAACACCATAATTTTGCAGCAGCTACTC 
CC ACTGCGAACCAAGGTG GCCAAACTACT CGGTTATAGCACACAT GCTGACTTCGTCCTTGAAATGAACAC 
TGC^^AAGAGCACAAGCCGCGTAACAGCCTTTCTAGATGATTTAAG 

CAGAACGAG AGTT TAT TT TGAATT TG AAGAAAAAGGAATG C AAAG ACAG GGGT TT TGAAT AT GAT GGGAAA 
ATCAATGCCTGGGATCTATATTACTACATGACTCAGACAGAGGAACTCAAGTATTCCATAGACCAAGAGTT 
CCTCAAGGAATACTTCCCAATTGAGGTGGTCACTGAAGGCTTGCTGAACACCTACCAGGAGTTGTTGGGAC 
TT TCAT TTGAACAAAT GACAGATGCT CAT GTT TGGAACAAGAGTGTTACACTTTATACTGTGAAGGATAAA 
GCTACAGGAGAAGTATTGGGACAGTTCTATTTGGACCTCTATCCAAGGCCTAGGGAAGGAAAATACAATCA 
TGCGGCCTGCTTCGGTCTCCAGCCTGGCTGCCTTCTGCCTGATGGAAGCCGGATGATGGCAGTGGCTGCCC 
TCGTGGTGAACTTCTCACAGCCAGTGGCAGGTCGTCCCTCTCTCCTGAGACACGACGAGGTGAGGACTTAC 
TTTCATGAGTTTGGTC ACSTGATGCATCAGAT TT GTGCACAGACT GATTTTGCACGATTTAGCGGAACAAA 
TGTGGAAACTGACTTTGTAGAGGTGCCATCGCAAATGCTTGAAAATTGGGTGTGGGACGTCGATTCCCTCC 
GAAG AT TGT C AAAACATT AT AAAGAT GGAAGC CC TAT TGC AGACGATC T GCTTGAAAAACT TGTTG CTT CT 
AGGCTGGTCAACACAGGTATGGGTTATG T TATTAGTAATATATAT TTTCTAGATATGTTTTCATTTCAGTG 
CATCCAAGTTGGAATGAAATACAGAAACCTAATCCTGAAACCTGGGGGAT CTCTGGACGGC AT GGACATGC 
TCCACMTTTCTTGAAACGTGAGCCAAACCAAAAAGCGTTCCTAATGAGTAGAGGCCTGCATGCTCCGTGA 
ACTGGGGATCTTTGGTAGCCGTCCATGTCTGGAGGACAAGTCGACATCACCATGTGTTACTGGCCTGGAAA 
CT GAAGGGAGTTTTGCAAGTGAAAAT TT AGAT TT CTATTG ACATC CTTTT GTTTTCT AATTTTAAAAATTA 
TAAAGATGTAAATGGAATTATAAATACTGTGACCTAAGAAAAGACCCACTAGAAAGTAATTGTACTATAAA 
ATTTCATAAAACTGGATTTGATTTCTTTTTATGAAAGTTTCATATGAATGTAACTTGATTTTTTACTATTA 
TAATCTAGATAATATGATATAAGAGGGCTAAGAATTTTTAAflTTGAATCArATATATGATATAATrTGATC 
CTTCTTGTATCTTGAAGTTTTGTACTTGGGATTTCTGGACTGATAAATGAATCATCACATTCTTCTGGTAA 
ATATTTTCTTGG 



The MOL9a protein encoded by SEQ ID NO:21 and has 638 amino acid residues, and is 
presented using the one-letter code in Table 9B (SEQ ID NO:22). The SignalP, Psort and or 
1 0 Hydropathy profile indicate that this sequence has a signal peptide between positions 23 and 24 
and is likely to be localized at endoplasmic reticulum and plasma membrane (Certainty = 
0.8200). 



Table 9B. Encoded MOL9a protein sequence (SEQ ID NO:22) 

MLTLDQQKSLILILFLILFRVGGSRILLRMT^ 

VQTKQVYDAVGMLGIEEVTYENCXQAIAVERmLDFPQHVSSDKEVRAASTEADKRLSRFDIEMSMRGDIF 
ERIVHLG^ETCDLGKIKPEARRYLEKSIKMGKRNGLHLPEQVQNEIKSMKKRMSELCIDFNKNLNEDDTFL 
VFSKAELGALPDDFIDSLEKTDDDKYKITLKYPHYFPVMKKCCIPETRRR 
PLRTKVAKLLGYSTHADFVLEMNTAKSTSRVTAFLDDI^ 

INAW PL YYYMTQTEELKYS I DQEFLKE YFP IE WT EGLLNTYQE LLGL S F EQMTDAHVWNKS VTL YTVKDK 
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ATGEVLGQFYIiDLYPRPREGKYNHAAC^ 

FHEFGHVMHQICAQTDFARFSGTNVETDFVEVPSQ^ 

RLVNT(^GWISNIYFLDMFSFQCIQV(34Kra^ 

The full amino acid sequence of the protein of the invention was found to have 483/564 
(85%) identity and 508/564 (90%) similarity to a rabbit endopeptidase (PatP Accession No. 
R261 14), and have 571 of 632 amino acid residues (90%) identical to, and 592 of 632 residues 
5 (93.6%) similar to, the 704 amino acid residue neurolysin protein from pig (pta:SPTREMBL- 
ACC: Q02038) (E value = Lie- 302 ). 



MOL9a also has homology to the proteins shown in the BLAST alignments in Table 9C. 



Table 9C. BLASTX results for MOL9a 






Smallest 








Sum 






High 


Prob 




Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


N 


ptnr:SWISSNEW-ACC:Q02038 NEUROLYSIN PRECURSOR (EC 3.4.24... 


. 2685 


l.le-302 


2 


ptnx:SWISSPROT-ACC:P42675 NEUROLYSIN PRECURSOR (EC 3.4.24.. 


. 2697 


2.2e-302 


2 


ptnr:SPTREMBL-ACC:P79433 BNDOPEPTIDASE 24.16 (EC 3.4.-.-).. 


. 2677 


7.6e-302 


2 


ptnr:SWISSPROT-ACC:P4267G NEUROLYSIN PRECURSOR (EC 3.4.24.. 


. 2601 


4.7e-291 


2 


ptnr:SWISSPROT-ACC:P47788 THIMET OLIGOPEPTIDASE (EC 3.4.2.. 


. 1792 


8.4e-197 


2 



Chromosomal information 

1 0 MOL9a maps to the Unigene entry Hs. 22 1 5 1 which maps to chromosome 5 between 

markers D5S427-D5S647 (69.6-74.7 cM). 

Tissue expression 

MOL9a is expressed in at least the following tissues: fetal lung, testis, B-cell, aorta, 
15 brain, colon, foreskin, germ cell, heart, kidney, pancreas, stomach, uterus, whole embryo and 
cancer cell lines MDA-MB-23 1 and MCF-7. 

These materials are further useful in the generation of antibodies that bind immuno- 
specifically to the novel MOL9a substances for use in therapeutic or diagnostic methods. These 
20 antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL9a protein has multiple hydrophilic regions, each of which can be 
used as an rmmunogen. In one embodiment, a contemplated MOL9a epitope is from about 
amino acids 50 to 75. In another embodiment, a MOL9a epitope is from about amino acids 100 
.25 to 1 50. In further embodiments, MOL9a epitopes are found in amino acids 175-200, 225-300, 
325-375, 425-450, 500-550, and 600-625. These novel proteins can also be used to develop 
assay system for functional analysis. 
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MOL9b 

The cloned open reading frame, codes for a 687 amino acid long protein with an overall 
95% amino acid identity, to the mature form of the pig neurolysin precursor (SWISSPROT- 
5 ACC:Q02038).Oligonucleotide primers were designed to PCR amplify a DNA segment, 
representing an ORF, coding for the mature form of 1 95067 19_B EXT. The forward primer 
includes an, in frame, BamHI restriction site. The reverse primer contains an, in frame, Xhol 
restriction site. The sequences of the PCR primers are the following: 



10 195067 1 9_B-EXT Mat-Forw: 

GGATCCTCCAGGATTTTACTCAGAATGACGTTAGG (SEQ ID NO:58) 
19506719_B-EXT FL-Rev: 

CTCGAGCGGAGCATGCAGGCCTCTACTCATTAGGAACG (SEQ ID NO:59) 



15 PCR reactions were set up using a total of 5ng cDNA, consisting equal amounts of 

cDNA derived from human fetal brain, testis, skeletal muscle and mammary, template, 1 microM 
of each of the 19506719_B-EXT Mat-Forw and 1 95067 19_B-EXT FL-Rev primers, 5 
micromoles dNTP (Clontech Laboratories, Palo Alto CA) and 1 microliter of 50xAdvantage-HF 
2 polymerase (Clontech Laboratories, Palo Alto CA) in 50 microliter volume. The following 

20 reaction conditions were used: 
a) 96°C 3 minutes 

b) 96°C 30 seconds denaturation 

c) 60°C 30 seconds annealing 

d) 72°C 3 minute extension. 

25 Repeat steps b-d 35 times 

e) 72oC 10 minutes seconds final extension 

A single, 2.1 kb large, PCR product, was isolated from agarose gel and ligated to pCR2.1 
vector (Invitrogen, Carlsbad, CA). The cloned insert was sequenced, using vector specific, Ml 3 
30 Forward(-40) and Ml 3 Reverse primers as well as the gene specific primers: 

GGACCATGCTAGACTTTCC, 19506719J3-EXT SI; (SEQ ID NO:60) 
GGAAAGTCTAGCATGGTCC, 1 95067 19_B-EXT S2; (SEQ ID NO:61) 
GGCTGAACTTGGTGCTCTTCC, 1 9506719_B-EXT S3; (SEQ ID NO:62) 
GGAAGAGCACCAAGTTCAGCC, 19506719_B-EXT S4; (SEQ IDNO:63) 
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GGCTTGCTGAACACCTACC, 1 95067 19_B-EXT S7; (SEQ ID NO:64) 
GGTAGGTGTTCAGCAAGCC, 1 95067 19_B-EXTS8; (SEQ IDNO:65) 
GCACAGACTGATTTTGCACG, 1 95067 19_B-EXT S9; (SEQ ID NO:66) and 
CGTGCAAAATCAGTCTGTGC, 1 95067 19_B-EXT S10 (SEQ ID NO:67). 

• 5 

The disclosed novel MOL9b nucleic acid of 2061 nucleotides (also referred to as 
MOL9b) is shown in Table 9D. It is thought that MOL9b is an internal fragment of an open 
reading frame. Therefore its 5' and V termini may be extended 



Table 9D. MOL9b Nucleotide Sequence (SEQ ID NO:23) 

TCCAGGAT TT TACTCAGAATG ACGTT AGGAAGAGAAG TGAT G TCTCCT CTTCAGGCAATGTCT T CCT ATAC 
TGTGGCTGGCAGAAATGTTTTAAG AT GGGAT CT TTCACCAGAGCAAAT T AAAACAAGAACT GAG GAGCTCA 
TTGTGCAGACCAAACAGGTGT ACGAT GCTGTTGGAAT GCT CG GTATTGAGGAAGTAACTTACGAGAACTGT 
CTGCAGGCACTGGCAGATGTAGAAGrAAAGTATATAGTGGAAAGGACCATGCTAGACTTTCCCCAGCATGT 
ATCCT CT GACAAAGAAGT ACG AGC AG C AAGTACAGAAGCAGACAAAAGACTTT CTCGTTT T GAT ATTGAGA 
TGAGC ATGAGAGGAGAT ATAT TTGAG AGAATTGTT CATTT ACAGGAAACCTGT GATCTGG G G AAGAT AAAA 
CC T GAGGC CAGAC GAT ACT TG G AAAAGTCAAT T AAAAT GGGGAAAAGAAATGG GCT CCAT C T T C CT GAAC A 
AGTACAGAATGAAATC AAATCAAT GAAGAAAAGAATGAGTGAGCr ATGTATTGAT TTTAACAAAAACCT CA 
ATGAGGATGATACCTTCCTTGTAT TT TCCAAGGCTGAACTTGGTGCTCTTCCT GATGATT TCAT TGACAGT 
TTAGAAAAGACAGATGATGACAAGTATAAAATTACCTT AAAATAT CCACACTATT TCCCT GTCATGAAGAA 
ATGTTGTATCCCTGAAACCAGAAGAAGGATGGAAATGGCTTTTAATACAAGGTGCAAAGAGGAAAACACCA 
TAATTTTGCAGGAGCTACTCCCACTGCGAACCAAGGTGGC(3AAACTACTCGGTTATAGCACACA 
TTCGTCCTTGAAATGAACACTGCAAAGAGCACAAGCCGCGTAACAGCCTTTCTAGATGATTTAAGCC 
GTT AAAACCCTTGGGT GAAGC AGAACGAGAGTTTATT TTGAATTTGAAGAAAAAGGAATGC AAAGACAGGG 
GT TT TGAATATGAT GGGAAAATCAATGCCTGGGATCT ATATT ACT ACAT GAC T CAGACAGAGGAACTCAAG 
TATTC C^T AG ACCAAGAGTTCCTCAAGGAATACTTCC CAATTGAGGTGGTCACTGAAGGCT TGCTGAAC AC 
CTACCAGGAGTTGTTGGGACT TTCATTTGAAC AAATGACAGATGCTCAT GTTTGGAACAAGAGT GTT AC AC 
TTTATACTGTGAAGGATAAAGCTAC^GGAGAAGTATTGGGACAGTTCTATTTGGACCTCTATCCAAGGGAA 
GGAAAATACAATCATGCGGCCTGCTTCGGTCTCCAGCCTGGCTGCCTTCTGCCTGATGGAAGCCGGATGAT 
GGCAGTGGCTGCCCTCGTGGTGAACTTCTCACAGCCAGTGGCAGGTCGTCCCTCTCTCCTGAGACACGACG 
AGGTGAGGACTTACT TTCATG AGT TTGGT CACGT GAT GCATCAGATTTGTGCACAG ACT GATTTTGCACGA 
TTTAGCGGAACAAATGTGGAAACTGACTTTGTAGAGGTGCC^TCGCAAATGCTTGAAAATTGGGTGTGGGA 
CGTCGATT CCCTCCGAAGATTGTCAAAAC ATT ATAAAG AT GGAAGCCCTATTGCAG ACGAT CT GCTT G AAA 
AACTTGTTGCTTCTAGGCTGGTCAACACAGGTCTTCTGACCCTGCGCCAGATTGTTTTGAGCAAAGTTGAT 
CAGTCTCTTCATAC(mCACATCGCTGGATGCTGC^GTGAATATGCCAAATACTGCTCAGAAATATTAGG 
AG T T G CAGCT ACT CCAGG CACAAAT ATG C CAGC T ACC T T T GGACAT T T GGCAGGGGGAT AC GAT GGCCAAT 
ATTATGGATATCTTTGGAGTGAAGTATTTTCCATGGATATGTTTTACAGCTGTTTTAAAAAAGAAGGGATA 
AT GAATCCGGAGGTTGGAATGAAATACAGAAACCTAATCCTGAAACCT GGGG GATCTCTGGACGGC ATGGA 
CATGCTCCACAATTTCTTGAAACGTGAGCCAAACCAAAAAGCGTTCCTAATGAGTAGAGGCCTGCATGCTC 

cg ; 

10 

The MOL9b protein encoded by SEQ ID NO:21 has 687 amino acid residues, and is 
presented using the one-letter code in Table 9E (SEQ ID NO:24). 



Table 9E. Encoded MOL9b protein sequence (SEQ ID NO:24) 

srillrmtlgrevms plqamssytvagrnvlrwdls peqtktrteelivqtkqvymvgmlgieevtyenc 
ixjaladvevkyivertmldfpqhvssdkevraastead& 

pearryleksikkgkrnglhlpeqvqneiksmkkrmselcidfnknlneddtflvfskaelgal 

lektdddkykitlkyphyfpvmkkccifetrrrmemtntrckeentiilqqllplrtkvakllgysthad 

fvlemntakst s rvt afx ddls qklkplgeaere filnlkkkeckdrgfeydgkinaw dl yyymtqteelk 

ysidqeflkeyfpievwegllntyqelix;lsfeqmtdahvwnksvtlytvkdkatgevlgqftld 

gkynhaacfgi^pgcllpdgsrmmavaalvvnfsqpvagrpsllrhdevrtyfhefg^^hqicaqtdfar 

fsgtnvetdfvevpsc^enwvwdvdslrrlskhykdgs 

qslhtntsldaaseyakycssilgvaatpgtnmpatfghlaggydgqyygylwsevfsmdmfyscfkkegi 

MNPEVGMKYI^LILKPGGSLDGMDMLHNFLKREPNQKAFLMSRGLHAP 
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MOL9c 

In the present invention, the target sequence identified previously, Accession Number 
19506719JB_EXT, was subjected to the exon linking process to confirm the sequence. PCR 
5 primers were designed by starting at the most upstream sequence available, for the forward 

primer, and at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was encountered, or, 
in the case of the reverse primer, until the stop codon was reached. Such primers were designed 

10 based on in silico predictions for the full length cDNA, part (one or more exons) of the DNA or 
protein sequence of the target sequence, or by translated homology of the predicted exons to 
closely related human sequences sequences from other species. These primers were then 
employed in PCR amplification based on the following pool of human cDNAs: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 

15 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 

kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

20 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and ESTs 
were included as components for an assembly when the extent of their identity with another 
component of the assembly was at least 95% over 50 bp. In addition, sequence traces were 
evaluated manually and edited for corrections if appropriate. These procedures provide the 
sequence reported below, which is designated Accession Number CG56222-01 

25 Hie disclosed novel MOL9c nucleic acid of 21 67 nucleotides (also referred to as 

CG56222-0 1) is shown in Table 9F An open reading frame begins with an ATG initiation codon 
at nucleotides 16-18 and ends witii a TGA codon at nucleotides 2128-2130. Aputative 
untranslated region upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 9F, and the start and stop codons are in bold letters. 

30 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence has 2000 of 2167 bases (92%) identical to a gb:GENBANK- 
ID:AB000170|acc:AB000170.1 mRNA from Sus scrofa (Porcine mRNA for endopeptidase 
24.16, complete cds) (Expect = 0.0). 
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Table 9F. MOL9c Nucleotide Sequence (SEQ ID NO:25) 

CCTCTCAGCXSCTCCCA TGATCGCCCGGTCrc^ 

GATTT^ACTX^GAATGACGTTAGGAAGAGAAGTGAT^ 

CTGGCAGAAATGTTOTAAGATGGGATCTTTCACCAGAGCAAATTO 

CAGACCAAACAGGTGTACGATGCTGTTGG^TGCTCGGTATXGAGGAAGTAACTTACGAGAACTGTCTGCA 
GGCACTGGCAGATGTAGAAGTAAAGTATATAGTGGAAAGGACCATGCTAGACTTTCCCCAGCATGTATCCT 
CTGACJWV3AAGTACG AGCAGCAAGT ACAG AAGCAGACAAAAGACTTT CT CGT TTTGAT ATTGAGATGAG C 
AT GAGAGG AGATAT AT TTGAGAGAATTGT TCATTTACAGG AAACCTGT GATCT GGGGAAGATAAAACCTGA 
GGCCAGACGATACTTGGAAAAGTCAATTAAAATGGGGAAAAGAAATGGGCTCCATOT 
AGAATGAAATCAAATCAATGAAGAAAAGAATGAGTGAGCT ATGTATTGAT TTT AACAAAAACCT CAATGAG 
GATGATACCTTCCTTGTATTTTCCAAGGCTGAACTTGGTGCTCTTCCTGATGATTTCATTGACAGTTTAGA 
AAAGACAGATGATGACAAGTATAAAATTACCTTAAAATATCC^CAC^^ 

GTATCCCTGAAACCAGAAGAAGGATGGAAATGGCTTTTAATACAAGGTGCAAAGAGGAAAACACCATAATT 

TTGCAGCAGCTACTCCCACTGCGAACCMGGTGGCCAAACTACTCGGTTATAGCACACATGCTGACTTCGT 

CCTTGAAATGAAC ACT GC AAAGAGCACAAGCCGCGTAACAGCCTT TCTAGATGATTTAAGCCAGAAGTTAA 

AACCC TTGGGTGAAGC AGAACGAG AGTTT AT TTTG AATTT GAAGAAAAAGGAATGCAAAGAC^ TTT 

GAATATGATGGGAAAATCAATGCCTGGGATCTATATTACTACATGACTCAGACAGAGGAACTCAAGTATTC 

CATAGACCAAGAGTTCCTCAAGGAATACTTCCCAATTGAGGTGGTCACTGAAGGCTTGCTGA^ 

AGGAGTTGTTGGGACTTTCATTTGAACAAATGACAGATGCTCATGTTTGGAACAAGAGTGTTACACTTTAT 

ACTGTGAAGGATAAAGCTACAGGAGAAGTATTGGGACAGTTCTATTTGGACCrCTATCCAAGGGAAGGAAA 

ATACAATCATGCGGCCTGCTTCGGTCTCC^GCCTGGCTGCCTTCTGCCTGATGGAAGCCGGATGATGGCAG 

TGGCTGCCCTCGTGGTGAACTTCTCACAGCCAGTGGCAGGTCGTCCCTCTCTCCTGAGACACGACGAGGTG 

AGGACTTACTTTCATGAGTTTGGTCACGTGATGCATCAGATTTGTGCACAGACTGATTTTGCACGATTTAG 

CGGAACAAATGTGGAAACTGACTTTGTAGAGGTGCCATCGCAAATGCTTGAAAATTGGGTGTGGGACGTCG 

ATTCCCTCCGAAGATTGTCAAAAJ^TTATAAAGATGGAAGCCCTATTGCAGACGATCTGCTTGAAAAACTT 

GTTGCTTCTAGGCTGGTCAAOU^^TCTTCTGACCCTGCGCC^GATTGTTTTGAGCAAAGTTGATCAGTC 

TC TTCAT ACCAAC ACATCGCTGGATGCTGCAAGTGAATAT GCCAAATACTGCTCAGAAATATTAGGAGT TG 

CAGCT ACTCCAGGCACAAAT AT GCCAGCT ACCT T T GGACATT TGGCAGGGGGAT AC GAT GG CCAAT ATT AT 

GGATATCTTTGGAGTGAAGTATTTTCCATGGATATGTTTTACAGCTGTTTTAAAAAAGAAGGGATAATGAA 

TCCGGAGGTTGGAATGAAATACAGAAACCTAATCCTGAAACCTGGGGGATCTCTGGACGGCATGGACATGC 

TCCACAATTTCTT GAAACGTGAGCCAAACCAAAAAGC GTTCCTAATGAGTAGAGGCCTGCATGC TCCGTGA 

ACTGGGGATCTTTGGTAGCCGTCCATGTCTGGAGGAC . 



The MOL9c protein encoded by SEQ ID NO:25 has 703 amino acid residues, and is 
presented using the one-letter code in Table 9G (SEQ ID NO:26). The SignalP, Psort and/or 
Hydropathy profile for MOL9c predict that MOL9c has a signal peptide and is likely to be 
5 localized at the cytoplasm with a certainty of 0.9200. The SignalP predicts a cleavage site at the 
sequence between amino acids 17 and 18. 



Table 9G. Encoded MOL9c protein sequence (SEQ ID NO:26) 

MIARCLLAVRSLRRVGGSRI LLRMTLGREVMS PLQAMS S YTVAGRNVLRWDLS PEQIKTRT EELI VQTKQV 
YDAVGMLGIEEVTYENCLQALADVEVKYIVERTML^ 

FERI^LQETCDLGKIKPEARRYIiEKSIKMGKRNGLHLPEQVQNEIKSMKKRMSELCIDFNKNLNEDDT^ 
VFSKAELGALPDDFIDSLEKTDDDKYKITLKYPHYFPVMKKCCIPETRRRMEMAFNTRCKEENTIIIiQQIi 
PLRTKVAKLLGYSTHADFVLEMNTAKSTSRVTAFLDDLSQI^ 
INAWDLYYYiyrTQTEELKYSIDO^FLKEYFPIEVVTEGLLNTYQELLGLSFEQMT 

ATGEVLGQFYLDLYPREGKYNHAACFGLQPGCLLPDGSRl^VAALVWFSQPVAGRPSLLRHDEVRTYFH 
EFGHVMHQICAQTDFARFSGTNVETDFVEVPSQMLENWVWDTO^ 

WTGLLTLRQIVLSBCVDQSLHTNTSLDAASEYAKYCSEILGVAAT PGTNMPAT FGHLAGGYDGQYYGYLWS 
E V FSMDMFYS C FKKEG IMN PE VGMKYRNLI LK PGG SL DGMDMLHN FLKRE PNQKAFLM S RGLHAP 



The full amino acid sequence of MOL9c was found to have 657 of 704 amino acid 

10 residues (93%) identical to, and 687 of 704 amino acid residues (97%) similar to, the 704 amino 

acid residue ptnr:SWISSPROT-ACC:P42675 protein from Oryctolagus cuniculus (Rabbit) 

(NEUROLYSIN PRECURSOR (EC 3.4.24.16) (NEUROTENSIN ENDOPEPTEDASE) 
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(MITOCHONDRIAL OLIGOPEPTIDASE M) (MICROSOMAL ENDOPEPTEDASE) (MEP) 
(E value = 0.0) 

MOL9c is expressed in at least the following tissues: Artery, Brain, Bronchus, Cartilage, 
Cervix, Colon, Coronary Artery, Dennis, Epidermis, Foreskin, Heart, Kidney, Liver, Ovary, 
Pancreas, Pituitary Gland, Placenta, Prostate, Salivary Glands, Synovium/Synovial membrane, 
Thalamus, Umbilical Vein, Uterus. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Literature sources, and/or RACE sources. 

Possible SNPs found forMOL9c are listed in Table 9H. 



Table 9H: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


399 


99 


A> G 


858 


51 


OA 


863 


50 


T> A 


1242 


48 


T>C 


1810 


141 


G> A 


; 1824 


143 


T>C 


1892 


144 


T>C 



These materials are further useful in the generation of antibodies that bind immuno- 
specifically to the novel MOL9c substances for use in therapeutic or diagnostic methods. These 
antibodies may be generated according to methods known in the art, using prediction from 

1 5 hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 

example the disclosed MOL9c protein has multiple hydrophilic regions, each of which can be 
used as an immunogen. In one embodiment, a contemplated MOL9c epitope is from about 
amino acids 25 to 75. In another embodiment, a MOL9c epitope is from about amino acids 100 
to 200. In further embodiments, MOL9a epitopes are found in amino acids 250-400, 450-550, 

20 and 650-700. These novel proteins can also be used to develop assay system for functional 
analysis. 

Homology between the MOL9 isoforms and other homologous proteins is presented 
graphically in the multiple sequence alignment given in Table 91 (with MOL9a being shown on 
line 1 , MOL9b on line 2, and MOL9c on line 3) as a ClustalW analysis comparing MOL9 with 
25 related protein sequences. 
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Table 91. Information for the ClustalW proteins: 

1) MOL9a (SEQ ID NO: 22) 

2) MOL9b (SEQ ID NO: 24) 

3) MOL9C (SEQ ID NO: 26) 

A) SWISSNEW-ACC:Q02038 NEUROLYSIN PRECURSOR (SEQ ID NO: 68) 

3) SWISSPROT-ACC:P42675 NEUROLYSIN PRECURSOR (SEQ ID NO: 69) 

5) S?TREMBL-ACC:P79433 END0PEP7IDASE 24.16 (SEQ ID NO:70) 

6) SWISSPROT-ACC:P42676 NEUROLYSIN PRECURSOR (SEQ ID NO:71) 

7) ACC:P47788 THIMET OLIGOPEPTIDASE (SEQ ID NO:72) 



M2L9a Prot 
NOL9b Prot 
M3I*9c Prot 
Q02038 
P42675 
P79433 
P42676 
P47788 



10 
.1 • 



20 



30 



40 50 

• I I ..--l-.-.l .---1 

— MLTLD QQKSLILILF Llf 



MIARCLLAV R! 

MIVECLSAA Rl 

MIARCFSAV ft 

MVYPEGHLAR ELGATFSSSA PLGGHPFPFV WDCLSCKQGD V7SQARPKTNA ERRSGj 

MITLCLSTL gl " 

KPPAA< 
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80 



90 
.1 



100 



110 
..I 



MOL9a Prot — 
MOL9b Prot - 
MOL9c Prot — 
Q02038 
P42675 
P79433 
P42676 
P47788 





130 



140 



150 



160 
..I 



170 
.1 



180 




MOL9a Prot 
MDL9b Prot 
MDlSc Prot 
Q02038 
P42675 
P79433 
P42676 
P47788 



MOL9a Prot 
MDL9b Prot 
MOL9c Prot 
Q02038 
P42675 
P79433 




.1 . 



280 
..I 



SKAELSALPD 
SKAELGALPD 
SKAELGALPD 
SKAELGALPD 
SKAELGALPD 
SKAELGALPD 
SKAELGALPD 



310 


PHYFPVKKKC 




CIFE 


PKYFPVMKKC 




C1PE 


PKYFPVKKKC 




CIPE 


PHYFPVMKKC 




CIPE 


PKYFPVMKKC 




CIPE 


PHYFPVMKKC 




CIPE 



320 
..I - 



330 



340 
..I 




KE^AfijTRCF EDOTIILQQL 

msmafJtrck eeStiilqql 1 

memafKtrce eeKtiilqql 

MSMAFiOTRCK EDjTIILQgL 

MEMAfStrCF. EDTirJjlLQQL 



msmafStrck 




70 



WO 01/81578 



PCTVUS01/13578 




. I . 



390 
..I - 



LSQKLKPLG 
LSQKLKPLG 
LSQKLKPLG 
LSQKLKPLG 
LSQKLKPLG 
LSQKLKPLG 
LSQKLKPLG 



AEREFILgL 

a.erefilSl 

AEREFIlSl 
AEREFIL'SjL 
AEREFILSL 

aerefilSJl 
aerefzlsl 





430 
..I . 



440 



450 



460 



470 



480 



MDL9a Prot 
MDL9b Prot 
MDL9a Prot 
Q02038 
P42675 
P79433 
P42676 
P47788 



MOL9a Prot 
MDL9b Prot 
M0L9c Prot 
Q02038 
P42675 
P79433 
P42676 
P47788 




• I . 



490 
..I 



LYTVKDKATG 
LYTVKDKATG 
LYTVKDKATG 
LYTVKDKATG 
LYTVKDKATG 
LYTVKDKATG 
LYTVKDKATG 




530 



540 



.1. 




PGCLLPDGSR 
PGCLLPDGSR 
PGCLLPDGSR 
PGCLLPDGSR 
PGCLLPDGSR 
PGCLLPDGSR 
PGCLLPDGSR 



590 
..I 



RFSGTgVETD 
RFSGTgVETD 
RFSGTgVETD 
RFSGTgVETD 
RFSGTgVETD 
RFSGTGVETD 

rfsgtHvetd 



610 
.1 



620 
..I . 



630 
..I • 



640 

..I • 



.1- 



MOL9a Prot 
M0L9b Prot 
MOL9c Prot 
Q02038 
P42675 
P79433 
P42676 
P47788 



M0L9a Prot 
MDL9b Prot 
MDL9c Prot 
Q02038 
P42675 
P79433 
P42676 
P47788 




LTLRQIVLSK 
LTLRQIVLSK 
LTLRQIVLSK 
LTLRQIVLSK 




650 



UmhliUWfcW HaaBflEH mfi 593 

r 245 
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P47788 








gi|7305353| 




OtfFFSKXPL 




gi 139278091 




U4LCQRAAWPGHAQNC 





Endopeptidase 24.16 or mitochondrial oligopeptidase, abbreviated here as EP 24.16 
(MOP), is a thiol- and metal-dependent oligopeptidase that is found in multiple intracellular 
compartments in mammalian cells. From an analysis of the corresponding gene, we found that 
5 the distribution of the enzyme to appropriate subcellular locations is achieved by the use of 

alternative sites for the initiation of transcription. The pig EP 24. 16 (MOP) gene spans over 100 
kilobases and is organized into 16 exons. The core protein sequence is encoded by exons 5-16 
which match perfectly with exons 2-13 of the gene for endopeptidase 24.15, another member of 
the thimet oligopeptidase family. These two sets of 1 1 exons share the same splice sites, 

10 suggesting a common ancestor. Multiple species of mRNA for EP 24.16 (MOP) were detected 
by the 5-rapid amplification of cDNA ends and they were shown to have been generated from a 
single gene by alternative choices of sites for the initiation of transcription and splicing. Two 
types of transcript were prepared, corresponding to transcription from distal and proximal sites. 
Their expression in vitro in COS-1 cells indicated that they encoded two isoforms (long and 

15 short) which differed only at their amino termini: the long form contained a cleavable 

mitochondrial targeting sequence and was directed to mitochondria; the short form, lacking such 
a signal sequence, remained in the cytosol. The complex structure of the EP 24.16 (MOP) gene 
thus allows, by alternative promoter usage, a fine transcriptional regulation of coordinate 
expression, in the different subcellular compartments, of the two isoforms arising from a single 

20 gene. PMID: 9182559, UI: 97326108 We have isolated a metallopeptidase from rat liver. The 
peptidase is primarily located in the mitochondrial intermembrane space, where it interacts non- 
covalently with the inner membrane. The enzyme hydrolyzes oligopeptides, the largest substrate 
molecule found being dynorphin A 1-1 7; it has no action on proteins, and does not interact with 
alpha 2-macroglobulin, and can therefore be classified as an oligopeptidase. We term the enzyme 

25 oligopeptidase M. Oligopeptidase M acts similarly to thimet oligopeptidase (EC 3.4.24.15) on 
bradykinin and several other peptides, but hydrolyzes neurotensin exclusively at the -Pro+Tyr- 
bond (the symbol + is used to indicate a scissile peptide bond) rather than the -Arg+Arg- bond. 
The enzyme is inhibited by chelating agents and some thiol-blocking compounds, but differs 
from thimet oligopeptidase in not being activated by thiol compounds. The peptidase is inhibited 

30 by Pro-He, unlike thimet oligopeptidase, and the two enzymes are separable in chromatography 

on hydroxyapatite. The N-terminal amino acid sequence of rat mitochondrial oligopeptidase M 

contains 19 out of 20 residues identical with a segment of rabbit microsomal endopeptidase and 

17 matching the corresponding segment of pig-soluble angiotensin II-binding protein. Moreover, 
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the rat protein is recognized by a monoclonal antibody against rabbit soluble angiotensin II- 
binding protein, all of which is consistent with these proteins being species variants of a single 
protein that is a bomologue of thimet oligopeptidase. The biochemical properties of the 
mitochondrial oligopeptidase leave us in no doubt that it is neurolysin (EC 3.4.24.16), for which 
5 no sequence has previously been reported, and which has not been thought to be mitochondrial. 
PMID: 7836437, UI: 95138171 We have isolated by immunological screening of a lambda 
ZAPII cDNA library constructed from rat brain mRNAs a cDNA clone encoding endopeptidase 
3.4.24.16. The longest open reading frame encodes a 704-amino acid protein with a theoretical 
molecular mass of 80,202 daltons and bears the consensus sequence of the zinc metalloprotease 

10 family. The sequence exhibits a 60.2% homology with those of another zinc metallopeptidase, 
endopeptidase 3.4.24.15. Northern blot analysis reveals two mRNA species of about 3 and 5 
kilobases in rat brain, ileum, kidney, and testis. We have transiently transfected COS-7 cells with 
pcDNA3 containing the cloned cDNA and established the overexpression of a 70-75-kDa 
imfnunoreactive protein. This protein hydrolyzes QFS, a quenched fluorimetric substrate of 

15 endopeptidase 3.4.24.16, and cleaves neurotensin at a single peptide bond, leading to the 

formation of neurotensin (1-10) and neurotensin (11-13). QFS and neurotensin hydrolysis are 
potently inhibited by the selective endopeptidase 3.4.24.16 dipeptide blocker Pro-lie and by 
dithiothreitol, while the enzymatic activity remains unaffected by phosphoramidon and captopril, 
the specific inhibitors of endopeptidase 3.4.24. 1 1 and angiotensin-converting enzyme, 

20 respectively. Altogether, these physicochemical, biochemical, and immunological properties 
unambiguously identify endopeptidase 3.4.24.16 as the protein encoded by the isolated cDNA 
clone. PMID: 7592986, UI: 96070836 A human genomic clone encompassing exons 1-3 of the 
neurotensin/neuromedin N gene was identified using a canine neurotensin complementary DNA 
probe. Sequence comparisons revealed that the 120-arnino acid portion of the precursor sequence 

25 encoded by exons 1-3 is 89% identical to previously determined cow and dog sequences and that 
the proximal 250 bp of 5' flanking sequences are strikingly conserved between rat and human. 
The 5' flanking sequence contains cis-regulatory sites required for the induction of 
neurotensin/neuromedin N gene expression in PCI 2 cells, including API sites and two cyclic 
adenosine-5'-monophosphate response elements. Oligonucleotide probes based on the human 

30 sequence were used to examine the distribution of neurotensin/neuromedin N messenger RNA in 
the ventral mesencephalon of schizophrenics and age- and sex-matched controls. 
Neurotensin/neuromedin N messenger RNA was observed in ventral mesencephalic cells some 
of which also contained melanin pigment or tyrosine hydroxylase messenger RNA. Neurons 
expressing neurotensin/neuromedin N messenger RNA were observed in the ventral 
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mesencephalon of both schizophrenic and non-schizophrenic humans. PMID: 1436492, UI: 
93063858 Neurotensin is a small neuropeptide of 13 amino acids that may function as a 
neurotransmitter or neuromodulator in the central nervous system. In the CNS, neurotensin is 
localized to the catecholamme-containing neurons. A catecholamine-producing cell line can also 
5 produce NT. Lithium salts, widely used in the treatment of manic-depressive patients, 

dramatically potentiate NT gene expression in this cell line. Gerhard et al. (1989) used a canine 
cDNA as a probe on a somatic cell hybrid panel to determine that the human gene is located on 
chromosome 12. The tridecapeptide neurotensin (162650) is widely distributed in various 
regions of the brain and in peripheral tissues. In the brain, neurotensin acts as a neuromodulator, 

10 in particular of dopamine transmission in the nigro striatal and mesocorticolimbic systems, 
suggesting its possible implication in dopamine-associated behavioral neurodegenerative and 
neuTopsychiatric disorders. Its various effects are mediated by specific membrane receptors. Vita 
et al. (1993) isolated a cDNA encoding the human neurotensin receptor and showed that it 
predicts a 418-amino acid protein that shares 84% homology with the rat protein. Le et al. (1997) 

15 also cloned the human neurotensin receptor (NTR) cDNA and its genomic DNA. The gene is 
encoded by 4 exons spanning more than 10 kb. The authors identified a highly polymorphic 
tetranucleotide repeat approximately 3 kb from the gene. Southern blot analysis revealed that the 
NTR gene is present in the human genome as a single-copy gene. Le et al. (1997) stated that the 
neurotensin receptor has 7 transmembrane spanning regions and high homology to other 

20 receptors that couple to G proteins 

Neurolysin is expressed ubiquitously in the rat brain (Massarelli et al. Brain Res 1999 
Dec 18; 851(1-2): 261-5; Dauch et al. J Neurochem 1992 Nov; 59(5): 1862-7). It has been 
suggested that this enzyme plays a role in the regulation of neurologically active peptides 
(Vincent et ah Br J Pharmacol 1997 Jun; 121(4): 705-10) and activity differs depending on the 

25 cellular source of this enzyme whether it is expressed in primary cultured neurons and astrocytes 
(Vincent et al J Neurosci 1996 Aug 15; 16(16): 5049-59). This might play a role in nociception 
and signal transduction in the brain as well as central nervous system. Related endopeptidases 
have been shown to play a role in processing angiotensin and important regulator of blood 
pressure. 

30 Uses of the Compositions of the Invention 

The expression pattern, map location and protein similarity information for MOL9 

suggest that it may function as neurolysin family. Therefore, the nucleic acids and proteins of 

the invention are useful in potential therapeutic applications implicated, for example but not 

limited to, in various pathologies /disorders as described below and/or other 

35 pathologies/disorders. Potential therapeutic uses for the inventions) are, for example but not 
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limited to, the following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody 
target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or 
prognostic marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) 
tissue regeneration in vitro and in vivo (regeneration for all these tissues and cell types 
5 composing these tissues and cell types derived from these tissues).These may also function in 
extracellular matrix remodeling in tissues described above. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies and disorders. For example, but not limited to, a cDNA encoding the neurolysin - 

1 0 like protein may be useful in gene therapy, and the neurolysin -like protein may be useful when 
administered to a subject in need thereof. By way of nonlimiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from Cancer, Trauma, 
Viral/bacterial/parasitic infections, Cardiomyopathy, Atherosclerosis, Hypertension, Congenital 
heart defects, Aortic stenosis, Atrial septal defect (ASD), Atrioventricular (A-V) canal defect, 

15 Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, Ventricular septal defect (VSD), valve 
diseases, Tuberous sclerosis, Scleroderma, Obesity, Transplantation, Atherosclerosis, 
Aneurysms, Hypertension, Fibromuscular dysplasia, Stroke, Scleroderma, Fertility, Diabetes, 
Von Hippel-Lindau (VHL) syndrome , Pancreatitis, Hirschsprung's disease , Crohn's Disease, 
Appendicitis, Alzheimer's disease, Stroke, Hypercalceimia, Parkinson's disease, Huntington's 

20 disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia- 
telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, Systemic 
lupus erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, Autoimmune 
disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney 
disease, Renal tubular acidosis, IgA nephropathy, Hypercalceimia, Lesch-Nyhan syndrome. The 

25 novel nucleic acid encoding the neurolysin-like protein, and the neurolysin -like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. These materials are 
further useful in the generation of antibodies that bind immunospecifically to the novel 
substances of the invention for use in therapeutic or diagnostic methods 

30 

MOL10 

MOLlOa 

A novel nucleic acid encoding a protein bearing sequence similarity to Cyclic- 
Nucleotide-Gated Olfactory Channel -like protein was identified by TblastN using CuraGen 
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Corporation's sequence file for MOL10 probe or homolog, run against the Genomic Daily Files 
made available by GenBank. The nucleic acid was further predicted by the program GenScan™, 
including selection of exons. These were further modified by means of similarities using 
BLAST searches. The sequences were then manually corrected for apparent inconsistencies, 
5 thereby obtaining the sequences encoding the full-length protein. The disclosed novel MOL1 0a 
nucleic acid of 1 835 nucleotides (also referred to as GM98960647_A) is shown in Table 10A. 
An open reading frame begins with an ATG initiation codon at nucleotides 54-56 and ends with 
a TGA codon at nucleotides 1788-1790. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon are underlined in Table 10A, and 
1 0 the start and stop codons are in bold letters. 

The nucleic acid sequence has 1 536 of 1733 bases (88%) identical to a Ratius norvegicus 
Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 mRNA (GENBANK-ID: U12623) (Expect = 
5.2e 108 ). 



Table 10A. MOLlOa Nucleotide Sequence (SEQ ID NO:27) 

TACAGGCAGAGAGGGTGTGGACATCTCACACCCCAGCACCAGACCACAGAACCATGAGCCAGGACRCCAAA 
GTGAAGACAACAGAGTCCAGTCCCCCAGCCC(^TCCAAGGCCAGG^GGAAGTTGCTGCCTGTCCTGGACCC 
ATCTGGGGAT TACTACTACTGGTGGCTGAACACAATGGT C TT COCAGT CATGTATAACCTCATCATCCTCG 
TGTGCAGGGCCTGCTTCCCCGACTTGCAGCACGGTTATCTGGTGGCCTGGTTGGTGCTGGACTACACGAGT 
GACCT GCTAT ACCTACTAGACATGGTGGT GCGCTTCC ACACAGGTGGATT CTTGGAACAGGGCAT CCTGGT 
GGTGGACAAGGGTAGGATCTCGAGTCGCTACGTTCGCACCTGGAGTTTCTTCTTGGACCTGGCTTCCCTGA 
TGCCCACAGATGTGGTCTACGTGCGGCTGGGCCCGCACACACCCACCCTGAGGCTGAA(XGCTTTCTCCGC 
GCGCCCCGCCTCTTCGAGGCCTTCGACCGCACAGAGACCCGCACAGCTTACCCAAATGCCTTTCGCATTGC 
CAAGCTGATGCTTTACATTTTT GT CGTCATCCAT TGG AACAGCTG CCT AT ACTT TGCCCTATCCCGGTACC 
TGGGCTTCGGGCGTGACGCATGGGTGTACCCGGACCCCGCGCAGCCTGGCTTTGAGCGCCTGCGGCGCCAG 
TACCTCTATAGCTTTTACTTCTCCACGCTGATACTGACTACAGTGGGCGATACACCGCCGCCAGCCAGGGA 
AGAAGAGTACCTCTTCATGGTGGGCGACTTCCTGCTGGCCGTCATGGGTTTCGCCACCATCATGGGTAGCA 
TGAGCTCTGTCATCTACAACATGAACACTGCAGATGCG^ 

TACAT GAAGCTGCAGCACGT CAACCGCAAGCTGGAGCGGCGAGTTATT GACTGGTATCAGC ACCTGCAGAT 
CAACAAGAAGATGACCAACGAGGTAGCC^TCTTACAGCACTTGCCTGAGCGGCTGCGGGCAGAAGTGGCTG 
TGTCTGTGCACCTGTCC^CTCTGAGCCGGGTGCAGATCTTTCAGAACTGTGAGGCCAGCCTGCTGGAGGAG 
CTGGT GCTGAAGCTGC AGCCC CAGACCTACTCAC CAGGTGAATATGTATGCCGCAAAGGAG ACATTGGCCA 
AGAGATGTACATCATCCGAGAGGGTCAACTGGCCGTGGTGGCAGATGATGGTATCACACAGTATGCTGTGC 
T CGGT GCAGGGCTCTACT TTGGGG AGATC AGCAT CAT CAACATCAAAGGT GGGAACATGT CTGGGAACCGC 
CGCACAGCC AACATCAAGAGCCTAGGTTATT CAGACCT AT TCTGCCTGAG CAAGGAGGACCTGCGGGAGGT 
GCTGAGCGAGTATCCACAAGCACAGACCATCATGGAGGAGAAAGGACGTGAGATCCTGCTGAAAATGAACA 
AGTTGGACGTGAATGCTGAGGCAGCTGAGATCGCCCT GCAGGAGGCCACAGAGTCCCGGCT ACGAGGC CTA 
GACCAGCAGCTGX3ATGATCTACAGACCAAGTTTGCTCGCCTCCTGGCTGAGCTGGAGTCCAGCGCACTTAA 
GATTGCTTACCGCATTGAACGGCT GGAGTGGCAGACT CGAGAGTGGCC AATGCCCG AGGACCTGGCTGAGG 
C T GAT G AC GAGGG T GAG C C TG AGGAGGGAACT T C CAAAGAT G AAGAGGGC AGG GC C AGCCAGG AGG GAC CC 
CCAGGTCCAGAGTGACCCCATCCCCATCCCCAGGATTCCCACCTCCTAGTGAATCCAGAG 

15 

The MOLlOa protein encoded by SEQ ID NO:27 has 638 amino acid residues, and is 
presented using the one-letter code in Table 10B (SEQ ID NO:28). PSORT analysis predicts the 
protein of the invention to be localized in the plasma membrane with a certainty of 0. 6000. 
Using the SIGNALP analysis, it is predicted that the protein of the invention has a signal peptide 
20 with most likely cleavage site between positions 57 and 58. 

76 



WO 01/81578 



PCT/US01/13578 



Table 10B. Encoded MOLlOa protein sequence (SEQ ID NO:28) 

MSQiyTKVKTTESSPPAPSKARRKLLPVLDPSGDYYY^ 

VLDYT S DLL YLLDMWRFHT GGFXEQG IL WDKGRI S S RYVRTWS FFL DLAS LMPT DWYVRLG PH T PTLR 
LNRFLRAPRLFEAFDRTETRTAYPNAFRI AKLML YI FWI HWNSCLYFALSRYLGFGRDAWVYPDPAQPGF 
ERLRRQYLYSFYFSTLILTTVGDTPPPAJIEEEYLFMVGDBIjIiAVMGFATIMGSMSSVIYNMNTADA 
HALVKKYMKLQHVNRKLE RRV I DWYQHLQ IN KKMTN E VAI LQHL PERLRAE VAVS VHLSTL SRVQI FQNCE 
ASLLEELVLKLQPQTYSPGEYVCRKGDIGQEMYI I REGQLAWADDG I TQ YAVLGAGLY FGEISIINI KGG 
NMSGNRRTANIKSIA3YS DLFCLS KEDLRE VLSE YPQAQTIfrffi 

S RLRGL DQQLDDLQTKFARLLAELESSAIiKIAYRI ERL3WQTREW PMPE DLAEADDEGE PE EGTS KDE EGR 
A5QEGPPGPE 

The full amino acid sequence of the protein of the invention was found to have 1068 of 
1649 amino acid residues (64%) identical to, and 1068 of 1649 residues (64%) positive with, the 
575 amino acid residue Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 subunit protein from 
5 Rattus norvegicus (ptnr: SPTREMBL- ACC : Q64359) (E value = 5.5c- 54 ), and .292 of 556 amino 
acid residues (52%) identical to, and 404 of 556 residues (72%) positive with, the 694 amino 
acid residue Cone Photoreceptor cGMP-Gated Channel Alpha Subunit Homo sapiens (Human) 
(pmr: TREMBLNEW -ACC: AAC17440) (E value - 5.8e- 157 ) 

MOLlOb 

10 In the present invention, the target sequence identified previously, MOLlOa Accession 

Number GM98960647_A (also known as CG54557-01), was subjected to the exon linking 
process to confirm the sequence. PCR primers were designed by starting at the most upstream 
sequence available, for the forward primer, and at the most downstream sequence available for 
the reverse primer. In each case, the sequence was examined, walking inward from the 

15 respective termini toward the coding sequence, until a suitable sequence that is either unique or 
highly selective was encountered, or, in the case of the reverse primer, until the stop codon was 
reached. Such primers were designed based on in silico predictions for the full length cDNA, 
part (one or more exons) of the DNA or protein sequence of the target sequence, or by translated 
homology of the predicted exons to closely related human sequences sequences from other 

20 species. These primers were then employed in PCR amplification based on the following pool of 
human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, 
placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, 

25 testis, thyroid, trachea, uterus. Usually the resulting amplicons were gel purified, cloned and 

sequenced to high redundancy. The resulting sequences from all clones were assembled with 

themselves, with other fragments in CuraGen Corporation's database and with public ESTs. 

Fragments and ESTs were included as components for an assembly when the extent of their 

identity with another component of the assembly was at least 95% over 50 bp. In addition, 
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sequence traces were evaluated manually and edited for corrections if appropriate. These 
procedures provide die sequence reported below, which is designated MOLlOb (Accession 
Number CG54557-02). This differs from the previously identified sequence [GM98960647_A 
(also known as CG54557-01)] at aminoacid position 1 59 T->A and has deletions at positions 22 
5 R,93Gand426G. 

The disclosed novel MOLlOb nucleic acid of 2551 nucleotides (also referred to as 
CG54557-02) is shown in Table 9D. An open reading frame begins with an ATG initiation 
codon at nucleotides 779-781 and ends with a TGA codon at nucleotides 2504-2506. A putative 
untranslated region upstream from the initiation codon and downstream from the termination 
10 codon are underlined in Table 10A, and the start and stop codons are in bold letters. 

The nucleic acid sequence has 1 1625 of 1 857 bases (87%) identical to a gb:GENBANK- 
ID:RNU12425|acc:U12425.1 mRNA from Rattus norvegicus (Rattus norvegicus olfactory cyclic 
nucleotide-gated channel mRNA, complete cds) (Expect = 4.0e" 316 ). 



Table 10D. MOLlOb Nucleotide Sequence (SEQ ID NO:29) 



GTTTTTGTTGTTTTGATATAGGAGATATTGAAGCAGGTTCACAAAAAGAGAAAAGTTGAj^AGATTGGGGAC 
CATAAAACACATGGAATGGTTGGTAGGATCAGGCACTAGAAGTCACAAGAAGGATATGAGGACAAAAGCAC 
CATAGGATGGCCCCTATCACACTACCTATGAGAATGGTGTGATGGGGGAAGGCGTATGTGGAGGTAGATAA 
GGGTAGGAAGTAGGTTACAAAAATAGAGCTCACTTCTCATGTGAGAGGCATCTCTTTGTCCCTGGAGAATA 
GTTTAGCACCTGACATAGATAAGCCATTCAGTAATAGTTGTTAAATAAATAAATAGTGAGGCCCAAATAGA 
ATTTGCAAAGATAAAACAGAGTGTTTGATCCTACACTAAAACTGAGGTCTTCTGACCCAGAGGACACCTAT 
GTAGCTCAGTTGCTGTGGAAAGAGGGGAGGAGGAAAACAGAGACAAGACTCAGGCTTCCCTCTGAGGCATG 
CACCCCCACCTTCTCCAGGGAT CTCATTAGAGGTGT TTAGCT GGGCAGGT GTAAGCCCAGGCCCTGGGAGA 
CAGGGCAGAGTGCTAGAGCTAGACTGTCTCCACCCCTTCAGTAGCGCTAGCTCTGGTTGTGTTGCTAAGAG 
CCCCAAAGACAAAGAAGTCACAGCAGAAGCCCAACAGCAGCCTCCTTC^ 

ACTCCAGAAGTCCCCTACAGGCAGAGAGGGTGTGGACATCTCACACCCCAGCACCAGACC^CAGAACCA TG 
AGCCAGGACACCAAAGTGAAGACAACAGAGTCCAGTCCCCCAGCCCCATCCAAGGCCAGGAAGTTGCTGCC 
TGTCCT GGACCCATCTGGGGATTACTACT ACTGGTGGCTGAACACAAT GGTCTTCCCAGTCATGTAT AACC 
TCATCATCCTCGTGTGCAGAGCCTGCTTCCCCGACTTGCAGCACGGTTATCTGGTGGCCTGGTTGGTGCTG 
GACTACACGAGTGACCTGCTAT ACCTACT AGACATGGTGGTGCGCT TCCACACAGGATTC? TGGAACAGGG 
CATCCTGGTGGTGGACAAGGGTAGGATCTCGAGTCGCTACGTTCGCACCTGGAGTTTCTTCTTGGACCTGG 
CTTCCCTGATGCCCACAGATGTGGTCTACGTGCGGCTGGGCCCGCACACACCCACCCTGAGGCTGAACCGC 
TTTCTCCGCGCGCCCCGCCTCTTCGAGGCCTTCGACCGCGCAGAGACCCGCACAGCTTACCCAAATGCCTT 
TCGCATTGCC7\AGCTGATGCTTTACATTTTTGTCGTCATCCATTGGAACAGCTGCCTATACTTTGCCCTAT 
CCCGGTACCTGGGCTTCGGGCGTGACGCATGGGTGTACCCGGACCCCGCGCAGCCTGGCTTTGAGCGCCTG 
CGGCGCCAGTACCTCTATAGCTTTTACTTCTCCACGCTGATACTGACTACAGTGGGCGATACACCGCCGCC 
AGCCAGGGAAGAAGAGTACCTCTTCATGGTGGGCGACTTCCTGCTGGCCGTCATGGGTTTCGCCACCATCA 
TGGGTAGCATGAGCTCTGTCATCTACAACATGAACACTGCAGATGCGGCTTTCTACCCAGATCATGCACTG 
GTG AAGAAGTACATGAAG CTGCAGCACGTCAACCGCAAGCTGGAGCGGCGAGT T ATTGACT GGTAT CAGCA 
CCTGCAGATCAACAAGAAGATGACCAACGAGGTAGCCATCTTACAGCACTTGGCTGAGCGGCTGCGGGCAG 
AAGTGGCTGTGTCTGTGCACCTGTCCACTCTGAGCCGGGTGCAGATCTTTCAG7^CTGTGAGGCCAGCCTG 
CTGGAGGAGCTGGTGCTGAAGCTGCAGCCCCAGACCTACTCACCAGGTGAATATGTATGCCGCAAAGGAGA 
CATTGGCCAAGAGATGTACATCATCCGAGAGGGTCAACTGGCCGTGGTGGCAGATGATGGTATCACACAGT 
ATGCTGTGCTCGGTGCAGGGCTCTACTTTGGGGAGATCAGCATCATCAACATCAAAGGGAACATGTCTGGG 
AACCGCCGCACAGCCAACAT C AAGAGCCTAGGTTAT TCAGACCTATTCTGCCTGAGCAAGGAGGACCTGCG 
GGAGGT GCTGAGCGAGTATCC ACAAGCACAGACCATCAT GGAGGAGAAAGGACGTGAGATCCTG CT GAAAA 
TGAACAAGTTGGACGTGAAT GCTGAGGCAGCTGAGATCGCCCTGCAGGAGGCCACAGAGTC CCGGCT ACGA 
GGCCTAGACCAGCAGCTGGATGATCTACAGACCAAGTTTGCTCGCCTCCTGGCTGAGCTGGAGTCCAGCGC 
ACTTAAGATTGCTTACCGCATTGAACGGCTGGAGTGGCAGACTCGAGAGTGGCCAATGCCCGAGGACCTGG 
CTGAGGCTGATGACGAGGGTGAGCCTGAGGAGGGAACTTCCAAAGATGAAGAGGGCAGGGCCAGCCAGGAG 
GGACCCCCAG^CCAGAGTGACCCCATCCCCATCCCCAGGATTCCCACCTCCTAGTGAATCCAGAG 

15 ^ ~ ~ " ~ ~ 
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The MOL9a protein encoded by SEQ ID NO:29 has 575 amino acid residues, and is 
presented using the one-letter code in Table 10B (SEQ ID NO:30). PSORT analysis predicts the 
protein of the invention to be localized in the plasma membrane with a certainty of 0. 6000. 
Using the SIGNALP analysis, it is predicted that the protein of the invention has a signal peptide 
5 with most likely cleavage site between positions 56 and 57 (CRA-CF) 



Table 10E. Encoded MOLlOb protein sequence (SEQ ID NO:30) 

MSQDTKVKTTESS PPAPSKARKIJiPVLDPSGDYYYWWLNirMVFPVMYNLI ILVCRACFPDLQHGYLVAWLV 
LDYTSDIJjYLLDMWRFHTGFL^ 

RFLRAPRLFEAFDRAETRT AYPNAFR IAKLMLYI FW I HWNS CLYFALS RYLG FGRDAWVY PD PAQPGFER 

LRRQYLYSFYFSTLILTTVGDTPPPAREEEYLFMVGDFL^ 

LVKKYMKLQHVNRKLERRVIDWYQHLQINKKMTNEVAILQHLPERLRAE 

LLEELVLKLQPQTYS PGE YVCElKGDIGQEb4YI IREGQLAW2U3DGITQYAVLGAGLYFGEI S I INIKGNMS 
GNRRTANIKSI^YSDLFCLSKEDLREVLSEYPQAC^ 

RGLDQQLDDLG/TKFARlJiAELESSALKIAYRIERLEWQTREWPMPEDIiAEADDEGEPEEGTSKDEEGRASQ 
EGPPGPE 

The full amino acid sequence of the protein of the invention was found to have 536 of 
575 amino acid residues (93%) identical to, and 552 of 575 amino acid residues (96%) similar to, 
10 the 575 amino acid residue ptnr:SWISSPROT-ACC:Q64359 protein from Rattus norvegicus 
(Rat) (CY CLIC-NUCLEOTTDE-GATED OLFACTORY CHANNEL OCNC2 SUBUMT) (E 
value = 4.2e- 287 ) 

Chromosomal information: 

The Cyclic-nucleotide gated olfactory channel ocnc2 disclosed in this invention maps to 
1 5 chromosome 1 1 . This information was assigned using OMIM, the electronic northern 
bioinformatic tool implemented by CuraGen Corporation, public ESTs, public literature 
references and/or genomic clone homologies. This was executed to derive the chromosomal 
mapping of the SeqCalling assemblies, Genomic clones, literature references and/or EST 
sequences that were included in the invention. 

20 Tissue expression 

The Cyclic-nucleotide gated olfactory channel ocnc2 disclosed in this invention is 
expressed in at least the following tissues: Adrenal gland, bone marrow, brain - amygdala, brain 
- cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal 
brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, 
25 pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, 
spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. . This information was derived by 
determining the tissue sources of the sequences that were included in the invention including but 
not limited to SeqCalling sources, Public EST sources, and/or RACE sources. 
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In addition, the sequence is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-ID: gb:GENBANK-E):RNU12425|acc:U12425J) a 
closely related Rattus norvegicus olfactory cyclic nucleotide-gated channel mRNA, complete cds 
homolog in species Rattus norvegicus : olfactory neuroepithelium. 

These materials are further useful in the generation of antibodies that bind immuno- 
specifically to the novel MOLlOb substances for use in therapeutic or diagnostic methods. 
These antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOLlOb protein has multiple hydrophilic regions, each of which can be 
used as an immunogen. In one embodiment, a contemplated MOLlOb epitope is from about 
amino acids 25 to 75. In another embodiment, a MOLlOb epitope is from about amino acids 1 to 
30. In further embodiments, MOLlOb epitopes are found in amino acids 150-250, 275-350, 375- 
400. and 425-560. These novel proteins can also be used to develop assay system for functional 



Homology between the MOL10 isoforms and other homologous proteins is presented 
graphically in the multiple sequence alignment given in Table 91 (with MOLlOa being shown on 
line 1 and MOLlOb on line 2) as a ClustalW analysis comparing MOL10 with related protein 
sequences. 



20 



25 



Table 10C. Information for the ClustalW proteins: 

1) MOLlOa (SEQ ID N0:28) 

2) MOLlOb (SEQ ID NO: 30) 

3) S35691 cyclic nucleotide-gated channel protein - rabbit (SEQ ID NO: 73) 

4) Q64359 Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 subunit protein from 
Rattus norvegicus (SEQ ID NO: 74) 

5) AAC17440 Cone Photoreceptor cGMP-Gated Channel Alpha Subunit Homo sapiens 
(SEQ ID N0:75) 



10 



MOLlOa Pro 
MOLlOb Pro 
S35691 
Q64359 
AAC17440 



MOLlOa Pro 
MOLlOb Pro 
S35691 
Q64359 
AAC17440 



MOLlOa Pro 
MOLlOb Pro 
835691 
Q64359 
AAC17440 
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30 



40 
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50 



60 
.1 



MSSWRSCARA PLSGSAWRRS AATRRSRRCL KTKRKRWSSG KGTPMQSTQC STRRRAQTPC 



MAKIHTQYSH P-S RTH LKVKTSDRDL NRAENGLSRA KSS SEET SSVLQPGIAM 

70 80 90 10C 110 
1 1 I \ I ....1 



1 
1 

60 
1 
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120 
•1 



ESTGHTWRNT EKSNGVKSSP ANNHNNHVPA TIKANGKDES RTRSR-PQSA ADDDTSSELQ 



ETRG LA DSGQGSFTGQ GIAR1SRLIF LLRRWAARHV EJ 

130 140 150 160 



. I . 



.1 



.1. 



.1 



150 
..I . 



DSF PDRFRGAELK 
170 180 



5 
5 

119 
5 

108 



.1 



KVKT7ES 
KVKTTES 



RLAEMpAPQQ RRGGFRRIVR LVGVIRQWAN RNFREEEARP DS 



-AN VGSQEBADRG 
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Cyclic nucleotide-gated (CNG) channels play central roles in visual and olfactory signal 
transduction. In the retina, rod photoreceptors express the subunits CNCalphal and CNCbetala. 
In cone photoreceptors, only CNCaipha2 expression has been demonstrated so far. Rat olfactory 
5 sensory neurons (OSNs) express two homologous subunits, here designated CNCaipha3 and 
CNCalpha4. This paper describes the characterization of CNCbetalb, a third subunit expressed 
in OSNs and establishes it as a component of the native channel CNCbetalb is an alternate 
splice form of the rod photoreceptor CNCbetala subunit. Analysis of mRNA and protein 
expression together suggest co-expression of all three subunits in sensory cilia of OSNs. From 
10 single-channel analyses of native rat olfactory channels and of channels expressed 

heterologously from all possible combinations of the CNCalpha3, -alpha4, and -betalb subunits, 
we conclude that the native CNG channel in OSNs is composed of all three subunits. Thus, CNG 
channels in both rod photoreceptors and olfactory sensory neurons result from coassembly of 
specific alpha subunits with various forms of an alternatively spliced beta subunit 
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Phototransduction is mediated by an enzymatic cascade that ultimately leads to the 
hydrolysis of cGMP. Hie photoreceptor cells, rods and cones, integrate and respond to cGMP 
hydrolysis via a cGMP-gated cation channel in the plasma membrane of the outer segment. 
Kaupp et al. (1989) cloned this channel from bovine retina. Dhallan et al. (1991) used the bovine 
5 sequence to isolate cDNA and genomic DNA encompassing the entire protein coding region of 
the human homolog. Assignment to chromosome 4 was achieved by study of somatic cell 
hybrids. Pittler et al. (1992) determined the primary structures of the human and mouse retinal 
rod cGMP-gated cation channel by analysis of cDNA clones and amplified DNA. The open 
reading frames predicted polypeptides of 690 and 683 residues, respectively, exhibiting 88% 

10 sequence similarity. Significant sequence similarity (59%) of the visual cGMP-gated channel to 
the olfactory cAMP-gated channel was pointed out The RNA transcript was found to be 3.2 kb 
long in human, mouse, and dog. By PCR used in connection with somatic cell hybrid DNAs, 
Pittler et al. (1992) mapped the CNCG gene to 4pl4-ql3 near the centromere. By interspecific 
backcross haplotype analysis, the corresponding gene in me mouse, Cncg, was mapped to a site 

15 0.9 cM proximal to the Kit locus on chromosome 5. Griffin et al. (1993) mapped the CNCG1 
gene to 4pl2-cen by fluorescence in situ hybridization. It is noteworthy that the rod cGMP PDE 
beta polypeptide (PDEB; 180072) also maps to 4p, at 4pl6.3. 
Uses of the Compositions of the Invention 

The protein similarity information, expression pattern, and map location for MOL1 0 

20 suggest that it may have important structural and/or physiological functions characteristic of the 
Cyclic-nucieotide gated channel family. Therefore, the nucleic acids and proteins of the 
invention are useful in potential diagnostic and therapeutic applications and as a research tool. 
These include serving as a specific or selective nucleic acid or protein diagnostic and/or 
prognostic marker, wherein the presence or amount of the nucleic acid or the protein are to be 

25 assessed, as well as potential therapeutic applications such as the following: (i) a protein 

therapeutic, (ii) a small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, 
drug targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene 
ablation), and (v) a composition promoting tissue regeneration in vitro and in vivo (vi) biological 
defense weapon. 

30 The nucleic acids and proteins of the invention are useful in potential diagnostic and 

therapeutic applications implicated in various diseases and disorders described below and/or 
other pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: 
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color blindness, CNS developmental disorders and other diseases, disorders and 
conditions of the like. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel substances of the invention for use in therapeutic or diagnostic 
5 methods. 

A summary of the MOLX nucleic acids and proteins of the invention is provided in Table 

11. 



TABLE 11: Summary Of Nucleic Acids And Proteins Of The Invention 



Name 


1 21UICS 


plnnpa Ttpcrrlntlfin nf Plnmnl/MT 
V^IUllC, ASCdLri|l LiUU VI QUlllUlUg 


1^ UUC1L 

Acid 
SEQID 
NO 


Amtnn 

/YilllilU 

Acid 
SEQID 
NO 


MOLl 


1A, IB, 


MOLl: GM_79960178 


1 


2 


MOL2 


2A.2B 


MOL2: 20466828_EXT1 


3 


4 


MOL3 


3A,3B 


MOL3: 82254077.0.1 


5 


6 


MOL4 


4A.4B, 


MOL4: AC004826 


7 


8 


MOL5 


5A, 5B, 


MOL5: AC025535 


9 


10 


MOL6 


6A.6B 


MOL6a: GM_87760758_A 


11 


12 




6D.6E 


MOL6b: GM_87760758_A_da 


13 


14 


MOL7 


7A.7B 


MOL7: 30675745.0.499 


15 


16 


MOL8 


8A.8B 


MOL8a: 11800699-0-16 


17 


18 




8D.8E 


MOL8b: CG56222-01 


19 


20 


MOL9 


9A.9B 


MOL9a: 1 95067 19_B_EXT 


21 


22 




9D.9E 


MOL9b: 1 95067 19_B_EXT-S773 


23 


24 




9F, 9G 


MOL9c: CG56222-01 


25 


26 


MOL10 


10A, 10B 


MOLlOa GM98960647_A 


27 


28 




10D, 10E 


MOL10bCG54557-02 


29 


30 



10 MOLX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
MOLX polypeptides or biologically active portions thereof. Also included in the invention are 
nucleic acid fragments sufficient for use as hybridization probes to identify MOLX-encoding 
nucleic acids (e.g. 9 MOLX mRNAs) and fragments for use as PCR primers for the amplification 
1 5 and/or mutation of MOLX nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (eg., cDNA or genomic DNA), RNA 
molecules (e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and 
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derivatives, fragments and homologs thereof. The nucleic acid molecule may be single-stranded 
or double-stranded, but preferably is comprised double-stranded DNA. 

An MOLX nucleic acid can encode a mature MOLX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
5 naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 

polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full-length 
gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more naturally 

10 occurring processing steps as they may take place within the cell, or host cell, in which the gene 
product arises. Examples of such processing steps leading to a "mature" form of a polypeptide 
or protein include the cleavage of the N-terminal methionine residue encoded by the initiation 
codon of an ORF, or the proteolytic cleavage of a signal peptide or leader sequence. Thus a 
mature form arising from a precursor polypeptide or protein that has residues 1 to N, where 

15 residue 1 is the N-terminal methionine, would have residues 2 through N remaining after 

removal of the N-terminal methionine. Alternatively, a mature form arising from a precursor 
polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence from 
residue 1 to residue M is cleaved, would have the residues from residue M+l to residue N 
remaining. Further as used herein, a "mature" form of a polypeptide or protein may arise from a 

20 step of post-translational modification other than a proteolytic cleavage event. Such additional 
processes include, by way of non-limiting example, glycosylation, myristoylation or 
phosphorylation. In general, a mature polypeptide or protein may result from the operation of 
only One of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable length, 

25 preferably between at least about 10 nucleotides (nt), 100 nt, or as many as approximately, e.g. f 
6,000 nt, depending upon the specific use. Probes are used in the detection of identical, similar, 
or complementary nucleic acid sequences. Longer length probes are generally obtained from a 
natural or recombinant source, are highly specific, and much slower to hybridize than shorter- 
length oligomer probes. Probes may be single- or double-stranded and designed to have 

30 specificity in PCR, membrane-based hybridization technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is separated 
from other nucleic acid molecules which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid 
(i,e., sequences located at the 5*- and S'-termini of the nucleic acid) in the genomic DNA of the 
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organism from which the nucleic acid is derived. For example, in various embodiments, the 
isolated MOLX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 
0.5 kb or 0. 1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
5 spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by recombinant 
techniques, or of chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a 

10 complement of this aforementioned nucleotide sequence, can be isolated using standard 

molecular biology techniques and the sequence information provided herein. Using all or a 
portion of the nucleic acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27, and 29 as a hybridization probe, MOLX molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, etaL, (eds.), MOLECULAR 

1 5 CLOMNG: A Laboratory MANUAL 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989; and Ausubel, et al, (eds.), Current Protocols IN Molecular Biology, 
John Wiley & Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 

20 amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
MOLX nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an 
automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide residues, 

25 which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction. 
A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA 
sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 
complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions 
of a nucleic acid sequence having about 10 nt, 50 nt, or 100 nt in length, preferably about 15 nt 

30 to 30 nt in length. In one embodiment of the invention, an oligonucleotide comprising a nucleic 
acid molecule less than 100 nt in length would further comprise at least 6 contiguous nucleotides 
of SEQ IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a complement thereof. 
Oligonucleotides may be chemically synthesized and may also be used as probes. 
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In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NOS: 1 , 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a portion of this nucleotide sequence (e.g. 9 
a fragment that can be used as a probe or primer or a fragment encoding a biologically-active 
5 portion of an MOLX polypeptide). A nucleic acid molecule that is complementary to the 

nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 
29 is one that is sufficiently complementary to the nucleotide sequence shown in SEQ ID 
NOS:l,3,5,7,9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29 that it can hydrogen bond with little 
or no mismatches to the nucleotide sequence shown SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 

10 2 1 , 23, 25, 27, and 29, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 
physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van der 

1 5 Waals, hydrophobic interactions, and the like. A physical interaction can be either direct or 
indirect. Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, the 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

20 Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 

acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization 
in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 
respectively, and are at most some portion less than a full length sequence. Fragments may be 
derived from any contiguous portion of a nucleic acid or amino acid sequence of choice. 

25 Derivatives are nucleic acid sequences or amino acid sequences formed from the native 

compounds either directly or by modification or partial substitution. Analogs are nucleic acid 
sequences or amino acid sequences that have a structure similar to, but not identical to, the native 
compound but differs from it in respect to certain components or side chains. Analogs may be 
synthetic or from a different evolutionary origin and may have a similar or opposite metabolic 

30 activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences 
of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules 
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comprising regions that are substantially homologous to the nucleic acids or proteins of the 
invention, in various embodiments, by at least about 70%, 80%, or 95% identity (with a 
preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
when compared to an aligned sequence in which the alignment is done by a computer homology 
5 program known in the art, or whose encoding nucleic acid is capable of hybridizing to the 
complement of a sequence encoding the aforementioned proteins under stringent, moderately 
stringent, or low stringent conditions. See e.g. Ausubel, et aL, CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" or 'liomologous amino acid sequence," or 

10 variations thereof, refer to sequences characterized by a homology at the nucleotide level or 

amino acid level as discussed above. Homologous nucleotide sequences encode those sequences 
coding for isoforms of MOLX polypeptides. Isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. Alternatively, 
isoforms can be encoded by different genes. In the invention, homologous nucleotide sequences 

1 5 include nucleotide sequences encoding for an MOLX polypeptide of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, rabbit, dog, 
cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not 
limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set 
forth herein. A homologous nucleotide sequence does not, however, include the exact nucleotide 

20 sequence encoding human MOLX protein. Homologous nucleic acid sequences include those 
nucleic acid sequences that encode conservative amino acid substitutions (see below) in SEQ ID 
NOS:l,3, 5, 7, 9, 11, 13, 15, 17, 19,21,23, 25, 27,and29, as well as a polypeptide possessing 
MOLX biological activity. Various biological activities of the MOLX proteins are described 
below. 

25 An MOLX polypeptide is encoded by the open reading frame ("ORF") of an MOLX 

nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
. codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or TGA. 

30 For the purposes of this invention, an ORF may be any part of a coding sequence, with or 

without a start codon, a stop codon, or both. For an ORF to be considered as a good candidate 
for coding for a bona fide cellular protein, a minimum size requirement is often set, e.g., a stretch 
of DNA that would encode a protein of 50 amino acids or more. 
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The nucleotide sequences determined from the cloning of the human MOLX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
MOLX homologues in other cell types, e.g. from other tissues, as well as MOLX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
5 oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 or 
400 comecutive sense strand nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, and 29; or an anti-sense strand nucleotide sequence of SEQ ED NOS:l, 3, 5, 7, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29; or of a naturally occurring mutant of SEQ ID 

10 NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29. 

Probes based on the human MOLX nucleotide sequences can be used to detect transcripts 
or genomic sequences encoding the same or homologous proteins. In various embodiments, the 
probe further comprises a label group attached thereto, e.g. the label group can be a radioisotope, 
a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part 

15 of a diagnostic test kit for identifying cells or tissues which mis-express an MOLX protein, such 
as by measuring a level of an MOLX-encoding nucleic acid in a sample of cells from a subject 
e.g., detecting MOLX mRNA levels or deterrnining whether a genomic MOLX gene has been 
mutated or deleted. 

"A polypeptide having a biologically-active portion of an MOLX polypeptide" refers to 
20 polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 

polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
active portion of MOLX" can be prepared by isolating a portion SEQ ID NOS:l, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, and 29 that encodes a polypeptide having an MOLX biological 
25 activity (the biological activities of the MOLX proteins are described below), expressing the 
encoded portion of MOLX protein {e.g. , by recombinant expression in vitro) and assessing the 
activity of the encoded portion of MOLX. 

MOLX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
30 sequences shown SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 due to 

degeneracy of the genetic code and thus encode the same MOLX proteins as that encoded by the 
nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 
29. In another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide 
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sequence encoding a protein having an amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 

10. 12. 14. 16, 18, 20, 22, 24, 26, 28, and 30. 

In addition to the human MOLX nucleotide sequences shown in SEQ ID NOS: 1, 3, 5, 7, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 it will be appreciated by those skilled in the art that 
5 DNA sequence polymorphisms that lead to changes in the amino acid sequences of the MOLX 
polypeptides may exist within a population (e.g., the human population). Such genetic 
polymorphism in the MOLX genes may exist among individuals within a population due to 
natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an MOLX protein, 

10 preferably a vertebrate MOLX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the MOLX genes. Any and all such nucleotide 
variations and resulting amino acid polymorphisms in the MOLX polypeptides, which are the 
result of natural allelic variation and that do not alter the functional activity of the MOLX 
polypeptides, are intended to be within the scope of the invention. 

1 5 Moreover, nucleic acid molecules encoding MOLX proteins from other species, and thus 

that have a nucleotide sequence that differs from the human sequence SEQ ID NOS: 1, 3, 5, 7, 9, 

1 1. 13. 15. 17, 19, 21, 23, 25, 27, and 29 are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the MOLX 
cDNAs of the invention can be isolated based on their homology to the human MOLX nucleic 

20 acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention 
is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 

25 21, 23, 25, 27, and 29. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 
500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet another embodiment, an 
isolated nucleic acid molecule of the invention hybridizes to the coding region. As used herein, 
the term "hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences at least 60% homologous to each 

30 other typically remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding MOLX proteins derived from species other than 
human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or high 
stringency hybridization with all or a portion of the particular human sequence as a probe using 
methods well known in the art for nucleic acid hybridization and cloning. 
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As used herein, the phrase "stringent hybridization conditions" refers to conditions under 
which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other 
sequences. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures than shorter 
5 sequences. Generally, stringent conditions are selected to be about 5 °C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tin is the 
temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of 
the probes complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied 

1 0 at equilibrium. Typically, stringent conditions will be those in which the salt concentration is 
less than about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at 
pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 

1 5 agents, such as formamide: 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, et 
a/., (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 
85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each 

20 other. A non-limiting example of stringent hybridization conditions are hybridization in a high 
salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 
more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to the sequences of SEQ ID NOS: 1, 3, 5, 7, 

25 9, 11, 13, 15, 17, 19, 21,23,25, 27, and 29 corresponds to a naturally-occurring nucleic acid 
molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNAor 
DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural 
protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid 
30 molecule comprising the nucleotide sequence of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 
21, 23, 25, 27, and 29 or fragments, analogs or derivatives thereof, under conditions of moderate 
stringency is provided. A non-limiting example of moderate stringency hybridization conditions 
are hybridization in 6X SSC, 5X Denhardfs solution, 0.5% SDS and 100 mg/ml denatured 
salmon sperm DNA at 55°C, followed by one or more washes in IX SSC, 0. 1% SDS at 37°C. 
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Other conditions of moderate stringency that may be used are well-known within the art. See, 
e.g., Ausubel, et al (eds.), 1993, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & 
Sons, NY, and Kriegler, 1 990; GENE TRANSFER AND EXPRESSION, A LABORATORY MANUAL, 
Stockton Press, NY. 

5 In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 

comprising the nucleotide sequences of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27, and 29 or fragments, analogs or derivatives thereof, under conditions of low stringency, is 
provided. A non-limiting example of low stringency hybridization conditions are hybridization 
in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 

10 Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) dextran sulfate at 

40°C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 
0. 1% SDS at 50°C. Other conditions of low stringency that may be used are well known in the 
art (e.g., as employed for cross-species hybridizations). See, Ausubel, et al (eds.), 1993, 
CURRENT PROTOCOLS IN Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, 

1 5 Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; Shilo and 
Weinberg, 1981. Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

In addition to naturally-occurring allelic variants of MOLX sequences that may exist in 
20 the population, the skilled artisan will further appreciate that changes can be introduced by 

mutation into the nucleotide sequences of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 
25, 27, and 29 thereby leading to changes in the amino acid sequences of the encoded MOLX 
proteins, without altering the functional ability of said MOLX proteins. For example, nucleotide 
substitutions leading to amino acid substitutions at "non-essential" amino acid residues can be 
25 made in the sequence of SEQ ED NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. A 
"non-essential" amino acid residue is a residue that can be altered from the wild-type sequences 
of the MOLX proteins without altering their biological activity, whereas an "essential" amino 
acid residue is required for such biological activity. For example, amino acid residues that are 
conserved among the MOLX proteins of the invention are predicted to be particularly non- 
30 amenable to alteration. Amino acids for which conservative substitutions can be made are well- 
known within the art 

Another aspect of the invention pertains to nucleic acid molecules encoding MOLX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
MOLX proteins differ in amino acid sequence from SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 
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20, 22, 24, 26, 28, and 30 yet retain biological activity. In one embodiment, the isolated nucleic 
acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an amino acid sequence at least about 45% homologous to the amino acid sequences 
of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. Preferably, the protein 
5 encoded by the nucleic acid molecule is at least about 60% homologous to SEQ ID NOS:2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; more preferably at least about 70% homologous 
to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; still more preferably at 
least about 80% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
and 30; even more preferably at least about 90% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 

10 14, 16, 18, 20, 22, 24, 26, 28, and 30; and most preferably at least about 95% homologous to 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 

An isolated nucleic acid molecule encoding an MOLX protein homologous to the protein 
of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 

15 sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded protein. 

Mutations can be introduced into SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30 by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 

20 predicted, non-essential amino acid residues. A "conservative amino acid substitution" is one in 
which the amino acid residue is replaced with an amino acid residue having a similar side chain. 
Families of amino acid residues having similar side chains have been defined within the art 
These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic 
side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains {e.g., glycine, 

25 asparagine, glutarnine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted non-essential amino acid residue in the MOLX protein 
is replaced with another amino acid residue from the same side chain family. Alternatively, in 

30 another embodiment, mutations can be introduced randomly along all or part of an MOLX 

coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened 
for MOLX biological activity to identify mutants that retain activity. Following mutagenesis of 
SEQ IDNOSil, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, the encoded protein can be 
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expressed by any recombinant technology known in the art and the activity of the protein can be 
determined 

The relatedness of amino acid families may also be determined based on side chain 
interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
5 conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MELV, MTLF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one of 
the following: CSA, ATV, SAG, STNX, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
10 VLIM, HFY, wherein the letters within each group represent the single letter amino acid code. 

In one embodiment, a mutant MOLX protein can be assayed for (f) the ability to form 
protein:protein interactions with other MOLX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (w) complex formation between a mutant MOLX protein 
and an MOLX ligand; or (Hi) the ability of a mutant MOLX protein to bind to an intracellular 
1 5 target protein or biologically-active portion thereof, (e.g. avidin proteins). 

In yet another embodiment, a mutant MOLX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
20 are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or fragments, 
analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that 
is complementary to a "sense" nucleic acid encoding a protein complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence). 
25 In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire MOLX 
coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, 
homologs, derivatives and analogs of an MOLX protein of SEQ ID NOS:2, 4, 6, 8, 10,12, 14, 
16, 18, 20, 22, 24, 26, 28, and 30, or antisense nucleic acids complementary to an MOLX nucleic 
30 acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, are 
additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence encoding an MOLX protein. The term "coding 
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region" refers to the region of the nucleotide sequence comprising codons which are translated 
into amino acid residues. In another embodiment, the antisense nucleic acid molecule is 
antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding the 
MOLX protein. The term "noncoding region" refers to 5' and 3' sequences which flank the 
5 coding region that are not translated into amino acids (i.e., also referred to as 5 1 and 3' 
untranslated regions). 

Given the coding strand sequences encoding the MOLX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to 

1 0 the entire coding region of MOLX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of MOLX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of MOLX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 

15 be constructed using chemical synthesis or enzymatic ligation reactions using procedures known 
in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
chemically synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids (e.g., phosphorothioate 

20 derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

25 inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
30 queosine, 2-thiocytosme, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 

uracil-5 -oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3 - (3 -amino-3 -N-2-carb oxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (ie. 9 RNA transcribed from the 
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inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
5 genomic DNA encoding an MOLX protein to thereby inhibit expression of the protein (e.g. , by 
inhibiting transcription and/or translation). The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. An example of a route of administration of antisense nucleic acid molecules of the 

10 invention includes direct injection at a tissue site. Alternatively, antisense nucleic acid 

molecules can be modified to target selected cells and then administered systemically. For 
example, for systemic administration, antisense molecules can be modified such that they 
specifically bind to receptors or antigens expressed on a selected cell surface (e.g., by linking the 
antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 

1 5 antigens). The antisense nucleic acid molecules can also be delivered to cells using the vectors 
described herein. To achieve sufficient nucleic acid molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol IE promoter 
are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
20 a-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms specific 

double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, the 
strands run parallel to each other. See, e.g., Gaultier, et aL, 1987. Nucl. Acids Res. 15: 
6625-6641. The antisense nucleic acid molecule can also comprise a 2 , -o-methylribonucleotide 
(see, e.g., Inoue, et al. 1987. Nucl Acids Res. 15: 6131-6148) or a chimeric KNA-DNA analogue 
25 (see, e.g. f Inoue, et al. 9 1987. FEBSLett 215: 327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, and 
nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
30 modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes 
are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 

96 



WO 01/81578 



PCT/US01/13578 



single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach 1988. 
Nature 334: 585-591) can be used to catalytically cleave MOLX mRNA transcripts to thereby 
inhibit translation of MOLX mRNA. A ribozyme having specificity for an MOLX-encoding 
5 nucleic acid can be designed based upon the nucleotide sequence of an MOLX cDNA disclosed 
herein (i.e., SEQ ED NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29). For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be constructed in which me nucleotide sequence 
of the active site is complementary to the nucleotide sequence to be cleaved in an 
. MOLX-encoding mRNA. See, e.g., U.S. Patent 4,987,071 to Cech, et al and U.S. Patent 

10 5,11 6,742 to Cech, et al MOLX mRNA can also be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel etal, (1993) 
Science 261:1411-1418. 

Alternatively, MOLX gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of the MOLX nucleic acid (e.g. , the MOLX promoter 

15 and/or enhancers) to form triple helical structures that prevent transcription of the MOLX gene in 
target cells. See, e.g., Helene, 1991. Anticancer Drug Des. 6: 569-84; Helene, et al 1992. Ann. 
N.Y. Acad. ScL 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the MOLX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g. , the stability, hybridization, or solubility of 

20 the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be 
modified to generate peptide nucleic acids. See, e.g., Hyrup, et al., 1996. BioorgMed Chem 4: 
5-23. As used herein, the terms peptide nucleic acids" or "PNAs" refer to nucleic acid mimics 
(e.g. , DNA mimics) in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

25 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup, et al, 1996. supra; 
Perry-O'Keefe, et al, 1996. Proc. Natl Acad. Sci. USA 93: 14670-14675. 

PNAs of MOLX can be used in therapeutic and diagnostic applications. For example, 

30 PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 

expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs of 
MOLX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., Si nucleases (see, Hyrup, et al,, \996.supra)' 9 or as probes or primers 
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for DNA sequence and hybridization {see, Hyrup, et aL, 1996, supra; Perry-O l Keefe, et al., 
1996. supra). 

In another embodiment, PNAs of MOLX can be modified, e.g. , to enhance their stability 
or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
5 PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in 
the art. For example, PNA-DNA chimeras of MOLX can be generated that may combine the 
advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes 
(e.g. 9 RNase H and DNA polymerases) to interact with the DNA portion while the PNA portion 
would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using 

10 linkers of appropriate lengths selected in terms of base stacking, number of bonds between the 
nucleobases, and orientation (see, Hyrup, et al., 1996. supra). The synthesis of PNA-DNA 
chimeras can be performed as described in Hyrup, et al., 1996. supra and Finn, et aL, 1996. Nucl 
Acids Res 24: 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 

1 5 S'^-memoxytriry^amino-S'-deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA. See, e.g., Mag, etaL, 1989. Nucl Acid Res '11: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA 
segment and a 3 1 DNA segment. See, e.g., Firm, et aL, 1996. supra. Alternatively, chimeric 
molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. See, e.g., Petersen, 

20 et aL, 1975. Bioorg. Med. Chem. Lett 5: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g. 9 Letsinger, et aL, 1989. Proc. Natl. Acad. Set U.SA. 86: 6553-6556; 
Lemaitre, et aL, 1987. Proc. NatL Acad. Sci. 84: 648-652; PCT Publication No. WO88/09810) or 

25 the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134). In addition, 

oligonucleotides can be modified with hybridization triggered cleavage agents (see, e.g., Kxo\, 'et 
aL, 1988. BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988. Phamt. Res. 5: 
539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 

30 cleavage agent, and the like. 

MOLX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino acid 
sequence of MOLX polypeptides whose sequences are provided in SEQ ID NOS:2, 4, 6, 8, 10, 
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12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. The invention also includes a mutant or variant protein 
any of whose residues may be changed from the corresponding residues shown in SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30 while still encoding a protein that 
maintains its MOLX activities and physiological functions, or a functional fragment thereof. 
5 In general, an MOLX variant that preserves MOLX-like function includes any variant in 

which residues at a particular position in the sequence have been substituted by other amino 
acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues from 
the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed by the 
10 invention. In favorable circumstances, the substitution is a conservative substitution as defined 
above. 

One aspect of the invention pertains to isolated MOLX proteins, and biologically-active 
portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided are 
polypeptide fragments suitable for use as irmnunogens to raise anti-MOLX antibodies. In one 

1 5 embodiment, native MOLX proteins can be isolated from cells or tissue sources by an 

appropriate purification scheme using standard protein purification techniques. In another 
embodiment, MOLX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an MOLX protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

20 An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the MOLX protein is derived, or substantially free from chemical precursors 
or other chemicals when chemically synthesized. The language "substantially free of cellular 
material" includes preparations of MOLX proteins in which the protein is separated from cellular 

25 components of the cells from which it is isolated or recombinantly-produced. In one 

embodiment, the language "substantially free of cellular material" includes preparations of 
MOLX proteins having less than about 30% (by dry weight) of non-MOLX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-MOLX proteins, still more preferably less than about 10% of non-MOLX proteins, and most 

30 preferably less than about 5% of non-MOLX proteins. When the MOLX protein or biologically- 
active portion thereof is recombinantly-produced, it is also preferably substantially free of 
culture medium, x.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the MOLX protein 
preparation. 
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The language "substantially free of chemical precursors or other chemicals" includes 
preparations of MOLX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations of 
5 MOLX proteins having less than about 30% (by dry weight) of chemical precursors or 

non-MOLX chemicals, more preferably less than about 20% chemical precursors or non-MOLX 
chemicals, still more preferably less than about 10% chemical precursors or non-MOLX 
chemicals, and most preferably less than about 5% chemical precursors or non-MOLX 
chemicals. 

1 0 Biologically-active portions of MOLX proteins include peptides comprising amino acid 

sequences sufficiently homologous to or derived from the amino acid sequences of the MOLX 
proteins (e.g., the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, and 30) that include fewer amino acids than the full-length MOLX proteins, and 
exhibit at least one activity of an MOLX protein. Typically, biologically-active portions 

1 5 comprise a domain or motif with at least one activity of the MOIX protein. A biologically- 
active portion of an MOLX protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 

20 functional activities of a native MOLX protein. 

In an embodiment, the MOLX protein has an amino acid sequence shown in SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. In other embodiments, the MOLX 
protein is substantially homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, and 30, and retains the functional activity of the protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 

25 14, 16, 18, 20, 22, 24, 26, 28, and 30, yet differs in amino acid sequence due to natural allelic 

variation or mutagenesis, as described in detail, below. Accordingly, in another embodiment, the 
MOLX protein is a protein that comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30, and retains the functional activity of the MOLX proteins of SEQ ID NOS:2, 4, 6, 

30 8, 10, 12; 14, 16, 18, 20, 22, 24, 26, 28, and 30. 



Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic acids, 
the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the 
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sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino 
acid positions or nucleotide positions are then compared. When a position in the first sequence 
is occupied by the same amino acid residue or nucleotide as the corresponding position in the 
5 second sequence, then the molecules are homologous at that position (i.e. , as used herein amino 
acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology maybe determined as the degree of identity 
between two sequences. The homology may be determined using computer programs known in 
the art, such as GAP software provided in the GCG program package. See, Needleman and 

10 Wunsch, 1970. JMol Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty 
of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a 
degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the 
CDS (encoding) part of the DNA sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 

15 19, 21, 23, 25, 27, and 29. 

The term "sequence identity" refers to the degree to which two polynucleotide of 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of positions 

20 at which the identical nucleic acid base (e.g„ A, T, C, G, U, or I, in the case of nucleic acids) 
occurs in both sequences to yield the number of matched positions, dividing the number of 
matched positions by the total number of positions in Ihe region of comparison (ie., the window 
size), and multiplying the result by 100 to yield the percentage of sequence identity. The term 
"substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, 

25 wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, 
preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually 
at least 99 percent sequence identity as compared to a reference sequence over a comparison 
region. 

30 Chimeric and Fusion Proteins 

The invention also provides MOLX chimeric or fusion proteins. As used herein, an 
MOLX "chimeric protein" or "fusion protein" comprises an MOLX polypeptide operatively- 
linked to a non-MOLX polypeptide. An "MOLX polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to an MOLX protein (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
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18, 20, 22, 24, 26, 28, and 30), whereas a "non-MOLX polypeptide" refers to a polypeptide 
having an amino acid sequence corresponding to a protein that is not substantially homologous 
to the MOLX protein, e.g., a protein that is different from the MOLX protein and that is derived 
from the same or a different organism Within an MOLX fusion protein the MOLX polypeptide 
5 can correspond to all or a portion of an MOLX protein. In one embodiment, an MOLX fusion 
protein comprises at least one biologically-active portion of an MOLX protein. In another 
embodiment, an MOLX fusion protein comprises at least two biologically-active portions of an 
MOLX protein. In yet another embodiment, an MOLX fusion protein comprises at least three 
biologically-active portions of an MOLX protein. Within the fusion protein, the term 

10 "operatively-linked" is intended to indicate that the MOLX polypeptide and the non-MOLX 
polypeptide are fused in-frame with one another. The non-MOLX polypeptide can be fused to 
the N-terminus or C-terminus of the MOLX polypeptide. 

In one embodiment, the fusion protein is a GST-MOLX fusion protein in which the 
MOLX sequences are fused to the C-terrninus of the GST (glutathione S-transferase) sequences. 

1 5 Such fusion proteins can facilitate the purification of recombinant MOLX polypeptides. 

In another embodiment, the fusion protein is an MOLX protein containing a heterologous 
signal sequence at its N-terminus. In certain host cells (e.g., mammalian host cells), expression 
and/or secretion of MOLX can be increased through use of a heterologous signal sequence. 
In yet another embodiment, the fusion protein is an MOLX-immunoglobulin fusion 

20 protein in which the MOLX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The MOLX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an 
interaction between an MOLX ligand and an MOLX protein on the surface of a cell, to thereby 
suppress MOLX-mediated signal transduction in vivo. The MOLX-immunoglobulin fusion 

25 proteins can be used to affect the bioavailability of an MOLX cognate ligand. Inhibition of the 
MOLX ligand/MOLX interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g. promoting or inhibiting) 
cell survival. Moreover, the MOLX-immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce anti-MOLX antibodies in a subject, to purify MOLX lig&nds, 

30 and in screening assays to identify molecules that inhibit the interaction of MOLX with an 
MOLX ligand. 

An MOLX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional techniques, 
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e.g. , by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including 
5 automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see, e.g., Ausubel, et al (eds.) CURRENT PROTOCOLS IN MOLECULAR 
BIOLOGY, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
10 available that already encode a fusion moiety (e.g. , a GST polypeptide). An MOLX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the MOLX protein. 

MOLX Agonists and Antagonists 

1 5 The invention also pertains to variants of the MOLX proteins that function as either 

MOLX agonists (i.e., mimetics) or as MOLX antagonists. Variants of the MOLX protein can be 
generated by mutagenesis (e.g., discrete point mutation or truncation of the MOLX protein). An 
agonist of the MOLX protein can retain substantially the same, or a subset of, the biological 
activities of the naturally occurring form of the MOLX protein. An antagonist of the MOLX 

20 protein can inhibit one or more of the activities of the naturally occurring form of the MOLX 
protein by, for example, competitively binding to a downstream or upstream member of a 
cellular signaling cascade which includes the MOLX protein. Thus, specific biological effects 
can be elicited by treatment with a variant of limited function. In one embodiment, treatment of 
a subject with a variant having a subset of the biological activities of the naturally occurring 

25 form of the protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the MOLX proteins. 

Variants of the MOLX proteins that function as either MOLX agonists (i.e., mimetics) or 
as MOLX antagonists can be identified by screening combinatorial libraries of mutants (e.g., 
truncation mutants) of the MOLX proteins for MOLX protein agonist or antagonist activity. In 

30 one embodiment, a variegated library of MOLX variants is generated by combinatorial 

mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated 
library of MOLX variants can be produced by, for example, enzymatically ligating a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential MOLX . 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
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proteins (e.g., for phage display) containing the set of MOLX sequences therein. There are a 
variety of methods which can be used to produce libraries of potential MOLX variants from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
5 appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential MOLX sequences. 
Methods for synthesizing degenerate oligonucleotides are well-known within the art. See, e.g., 
Narang, 1983. Tetrahedron 39: 3; Itakura, et al. 9 1984. Annu. Rev. Biochem. 53: 323; Itakura, et 
al, 1984. Science 198: 1056; Dee, et al> 1983. Nucl Acids Res. 1 1 : 477. 

10 

Polypeptide Libraries 

In addition, libraries of fragments of the MOLX protein coding sequences can be used to 
generate a variegated population of MOLX fragments for screening and subsequent selection of 
variants of an MOLX protein. In one embodiment, a library of coding sequence fragments can 

15 be generated by treating a double stranded PCR fragment of an MOLX coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA, renaturing the DNA to form double-stranded DNA that can include 
sense/antisense pairs from different nicked products, removing single stranded portions from 
reformed duplexes by treatment with Si nuclease, and ligating the resulting fragment library into 

20 an expression vector. By this method, expression libraries can be derived which encodes 
N-terminal and internal fragments of various sizes of the MOLX proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 

25 gene libraries generated by the combinatorial mutagenesis of MOLX proteins. The most widely 
used techniques, which are amenable to high throughput analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing die 
combinatorial genes under conditions in which detection of a desired activity facilitates isolation 

3 0 of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a new technique that enhances the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to identify MOLX variants. See, e.g, Arkin 
and Yourvan, 1992. Proc. Natl Acad. Sci. USA 89: 7811-7815; Delgrave, era/., 1993. Protein 
Engineering 6:327-331. 
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Anti-MOLX Antibodies 

The invention encompasses antibodies and antibody fragments, such as or (F a b>2, that 
bind immunospecifically to any of the MOLX polypeptides of said invention. 

An isolated MOLX protein, or a portion or fragment thereof, can be used as an 
5 immunogen to generate antibodies that bind to MOLX polypeptides using standard techniques 
for polyclonal and monoclonal antibody preparation. The full-length MOLX proteins can be 
used or, alternatively, the invention provides antigenic peptide fragments of MOLX proteins for 
use as immunogens. The antigenic MOLX peptides comprises at least 4 amino acid residues of 
the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 

10 and 30 and encompasses an epitope of MOLX such that an antibody raised against the peptide 
forms a specific immune complex with MOLX. Preferably, the antigenic peptide comprises at 
least 6, 8, 10, 1 5, 20, or 30 amino acid residues. Longer antigenic peptides are sometimes 
preferable over shorter antigenic peptides, depending on use and according to methods well 
known to someone skilled in the art 

15 In certain embodiments of the invention, at least one epitope encompassed by the 

antigenic peptide is a region of MOLX that is located on the surface of the protein (e.g., a 
hydrophilic region). As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity and hydrophobicity may be generated by any method well known in the 
art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 

20 without Fourier transformation {see, e.g., Hopp and Woods, 1981. Proc. Nat. Acad Set USA 78: 
3824-3828; Kyte and Doolittle, 1982. J. Mol Biol. 157: 105-142, each incorporated herein by 
reference in their entirety). 

As disclosed herein, MOLX protein sequences of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
1 8, 20, 22, 24, 26, 28, and 30, or derivatives, fragments, analogs or homologs thereof, may be 

25 utilized as immunogens in the generation of antibodies that immunospecifically-bind these 

protein components. The term "antibody" as used herein refers to immunoglobulin molecules 
and immunologically-active portions of immunoglobulin molecules, i.e., molecules that contain 
an antigen binding site that specifically-binds (immunoreacts with) an antigen, such as MOLX. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 

30 and F(ab«)2 fragments, and an Fab expression library. In a specific embodiment, antibodies to 

human MOLX proteins are disclosed. Various procedures known within the art may be used for 
the production of polyclonal or monoclonal antibodies to an MOLX protein sequence of SEQ ED 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or a derivative, fragment, analog or 
homolog thereof. Some of these proteins are discussed below. 
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For the production of polyclonal antibodies, various suitable host animals {e.g, rabbit, 
goat, mouse or other mammal) may be immunized by injection with the native protein, or a 
synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic 
preparation can contain, for example, recombinantly-expressed MOLX protein or a chemically- 
5 synthesized MOLX polypeptide. The preparation can further include an adjuvant Various 
adjuvants used to increase the immunological response include, but are not limited to, Freund's 
(complete and incomplete), mineral gels {e.g., aluminum hydroxide), surface active substances 
(e.g tJ lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
human adjuvants such as Bacille Calmette-Guerin and Corynebacterium parvum, or similar 

10 irnmunostimulatory agents. If desired, the antibody molecules directed against MOLX can be 
isolated from the mammal {e.g., from the blood) and further purified by well known techniques, 
such as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, 
refers to a population of antibody molecules that contain only one species of an antigen binding 

1 5 site capable of immunoreacting with a particular epitope of MOLX. A monoclonal antibody 

composition thus typically displays a single binding affinity for a particular MOLX protein with 
which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular 
MOLX protein, or derivatives, fragments, analogs or homologs thereof, any technique that 
provides for the production of antibody molecules by continuous cell line culture may be 

20 utilized. Such techniques include, but are not limited to, the hybridoma technique {see, e.g, 
Kohler & Milstein, 1975. Nature 256: 495-497); the trioma technique; the human B-cell 
hybridoma technique {see, e.g., Kozbor, et al, 1983. Immunol. Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies {see, e.g, Cole, et al, 1985. In: 
MONOCLONAL Antibodies AND Cancer THERAPY, Alan R. Liss, Inc., pp. 77-96). Human 

25 monoclonal antibodies may be utilized in the practice of the invention and may be produced by 
using human hybridomas {see, e.g. t Cote, et al, 1983. Proc Natl Acad Sci USA 80: 2026-2030) 
or by transforming human B-cells with Epstein Barr Virus in vitro {see, e.g. t Cole, et al, 1985. 
In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Each of 
the above citations is incorporated herein by reference in their entirety. 

30 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an MOLX protein {see, e.g, U.S. Patent No. 4,946,778). In addition, 
methods can be adapted for the construction of expression libraries {see, e.g., Huse, et al, 
1989. Science 246: 1275-1281) to allow rapid and effective identification of monoclonal 
fragments with the desired specificity for an MOLX protein or derivatives, fragments, analogs or 
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homologs thereof. Non-human antibodies can be "humanized" by techniques well known in the 
art See, e.g., U.S. Patent No. 5,225,539. Antibody fragments that contain the idiotypes to an 
MOLX protein may be produced by techniques known in the art including, but not limited to: (i) 
an F(ab-)2 fragment produced by pepsin digestion of an antibody molecule; (n) an Fab fragment 
5 generated by reducing the disulfide bridges of an F^-p fragment; {Hi) an F^ fragment generated 
by the treatment of the antibody molecule with papain and a reducing agent; and (iv) F v 
fragments. 

Additionally, recombinant anti-MOLX antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be made 

10 using standard recombinant DNA techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in International Application 
No. PCT/US86/02269; European Patent Application No. 1 84, 1 87; European Patent Application 
No. 171,496; European Patent Application No. 173,494; PCT International Publication No. WO 

15 86/01533; U.S. Patent No. 4,816,567; U.S. Pat. No. 5,225,539; European Patent Application No. 
125,023; Better, et al., 1988. Science 240: 1041-1043; Liu, et al., mi. Proc. Natl Acad. Set. 
USA 84: 3439-3443; Liu, eta!., 1987./. Immunol. 139: 352.1-3526; Sun, etal, 1987. Proc. Natl 
Acad Sci. USA 84: 214-218; Nishimura, etal, 1987. Cancer Res. 47: 999-1005; Wood, etal, 

1985. Nature 314 :446-449; Shaw, et al, 1988. Natl Cancer Inst. 80: 1553-1559); 

20 Morrison(1985) Science 229:1202-1207; Oi, et al. (1986) BioTechniques 4:214; Jones, et al, 

1986. Nature 321: 552-525; Verhoeyan, etal, 1988. Science 239: 1534; andBeidler, et al, 
1988. Immunol 141: 4053-4060. Each of the above citations are incorporated herein by 
reference in their entirety. 

In one embodiment, methods for the screening of antibodies that possess the desired 
25 specificity include, but are not limited to, enzyme-linked immunosorbent assay (ELISA) and 
other immunologically-mediated techniques known within the art. In a specific embodiment, 
selection of antibodies that are specific to a particular domain of an MOLX protein is facilitated 
by generation of hybridomas that bind to the fragment of an MOLX protein possessing such a 
domain. Thus, antibodies that are specific for a desired domain within an MOLX protein, or 
30 derivatives, fragments, analogs or homologs thereof, are also provided herein. 

Anti-MOLX antibodies may be used in methods known within the art relating to the 
localization and/or quantitation of an MOLX protein (e.g., for use in measuring levels of the 
MOLX protein within appropriate physiological samples, for use in diagnostic methods, for use 
in imaging the protein, and the like). In a given embodiment, antibodies for MOLX proteins, or 
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derivatives, fragments, analogs or homologs thereof, that contain the antibody derived binding 
domain, are utilized as phaimacologically-active compounds (hereinafter "Therapeutics 11 ). 

An anti-MOLX antibody (eg., monoclonal antibody) can be used to isolate an MOLX 
polypeptide by standard techniques, such as affinity chromatography or immunoprecipitation. 
5 An anti-MOLX antibody can facilitate the purification of natural MOLX polypeptide from cells 
and of recombinantly-produced MOLX polypeptide expressed in host cells. Moreover, an 
anti-MOLX antibody can be used to detect MOLX protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the MOLX protein. 
Anti-MOLX antibodies can be used diagnostically to monitor protein levels in tissue as part of a 

10 clinical testing procedure, e.g., to, for example, detennine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (i.e f physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 

1 5 P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 

include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazmylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 

20 examples of suitable radioactive material include 125 I, l3I I, 35 S or 3 H. 

MOLX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an MOLX protein, or derivatives, fragments, analogs or 

25 homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a "plasmi^', 
which refers to a circular double stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a host 

30 cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors). Other vectors (eg., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively-linked. Such vectors are referred to herein as 
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"expression vectors". In general, expression vectors of utility in recombinant DNA techniques 
are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be 
used interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral vectors 
5 (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve 
equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that the 
recombinant expression vectors include one or more regulatory sequences, selected on the basis 

10 of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence 
to be expressed. Within a recombinant expression vector, "operably-linked" is intended to mean 
that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner that 
allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation 
system or in a host cell when the vector is introduced into the host cell). 

1 5 The term "regulatory sequence" is intended to includes promoters, enhancers and other 

expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, GENE EXPRESSION Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell and 

20 those that direct expression of the nucleotide sequence only in certain host cells (e.g. , 

tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 

25 fusion proteins or peptides, encoded by nucleic acids as described herein (e.g. , MOLX proteins, 
mutant forms of MOLX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
MOLX proteins in prokaryotic or eukaryotic cells. For example, MOLX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 

30 vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology : Methods in Enzymology 185, Academic Press, San Diego, 
Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated 
in vitro, for example using T7 promoter regulatory sequences and 17 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, 
usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve 
5 three purposes: (J) to increase expression of recombinant protein; (z"0 to increase the solubility of 
the recombinant protein; and (in) to aid in the purification of the recombinant protein by acting 
as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage 
site is introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to purification of the 

10 fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, 
thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, 
Mass.) and pRIT5 (Pharmacia, Piscataway, NJ.) that fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

15 Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 

(Amrann et al. 9 (1988) Gene 69:301-3 15) and pET 1 Id (Studier et ah, Gene EXPRESSION 
TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express the 
20 protein in a host bacteria with an impaired capacity to proteolyticaily cleave the recombinant 

protein. See, e.g., Gottesman, Gene EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, 
Academic Press, San Diego, Calif. (1990) 119-128. Another strategy is to alter the nucleic acid 
sequence of the nucleic acid to be inserted into an expression vector so that the individual codons 
for each amino acid are those preferentially utilized in E. coli (see, e.g., Wada, et a/., 1992. Nucl. 
25 Acids Res. 20:2111-2118). Such alteration of nucleic acid sequences of the invention can be 
carried out by standard DNA synthesis techniques. 

In another embodiment, the MOLX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldari, et al, 1987. EMBOJ. 6: 229-234), pMFa (Kurjanand Herskowitz, 1982. Cell 30: 
30 933-943), pJRY88 (Schultz et al> 1987. Gene 54: 1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, MOLX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 
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SF9 cells) include the pAc series (Smith, etal, 1983. Mol Cell Biol 3: 2156-2165) and the 
pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors include 
5 pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et al. % 1987. EMBO J. 6: 
187-195). When used in mammalian cells, the expression vector's control functions are often 
provided by viral regulatory elements. For example, commonly used promoters are derived from 
polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable expression 
systems for both prokaryotic and eukaryotic cells see, e.g. 5 Chapters 16 and 17 of Sambrook, et 

10 a/., Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vector is capable of 
directing expression of the nucleic acid preferentially in a particular cell type (e.g. 9 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 

1 5 regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert, et at., 1987. Genes Dev. 1: 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol 43: 235-275), 
in particular promoters ofT cell receptors (Winoto and Baltimore, 19S9.EMBOJ. 8: 729-733) 
and immunoglobulins (Banerji, et a/., 1983. Cell 33: 729-740; Queen and Baltimore, 1983. Cell 

20 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle, 
1989. Proc. Natl Acad. Set USA 86: 5473-5477), pancreas-specific promoters (Edlund, et al 9 
1985. Science 230: 912-916), and mammary gland-specific promoters (e.g, milk whey promoter, 
U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 

25 1990. Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1989. 
Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That is, 
the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows for 

30 expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 
MOLX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 
antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of 
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antisense RNA. The antisense expression vector can be in the form of a recombinant plasmid, 
phagemid or attenuated virus in which antisense nucleic acids are produced under the control of 
a high efficiency regulatory region, the activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the regulation of gene expression using 
5 antisense genes see, e.g. t Weintraub, et aL, "Antisense RNA as a molecular tool for genetic 
analysis," Reviews-Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and "recombinant 
host cell" are used interchangeably herein. It is understood that such terms refer not only to the 

10 particular subject cell but also to the progeny or potential progeny of such a cell. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still included 
within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, MOLX protein can 

15 be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those 
skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 

20 "transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. 
Suitable methods for transforming or transfecting host cells can be found in Sambrook, et aL 
(Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, 

25 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory 
manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
30 encodes a selectable marker (e.g. , resistance to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G41 8, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding MOLX 
or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic 
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acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (i.e. 9 express) MOLX protein. Accordingly, the invention further provides 
5 methods for producing MOLX protein using the host cells of the invention. In one embodiment, 
the method comprises culturing the host cell of invention (into which a recombinant expression 
vector encoding MOLX protein has been introduced) in a suitable medium such that MOLX 
protein is produced. In another embodiment, the method further comprises isolating MOLX 
protein from the medium or the host cell. 

1 0 Transgenic MOLX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an 
embryonic stem cell into which MOLX protein-coding sequences have been introduced. Such 
host cells can then be used to create non-human transgenic animals in which exogenous MOLX 

1 5 sequences have been introduced into their genome or homologous recombinant animals in which 
endogenous MOLX sequences have been altered. Such animals are useful for studying the 
function and/or activity of MOLX protein and for identifying and/or evaluating modulators of 
MOLX protein activity. As used herein, a "transgenic animal" is a non-human animal, 
preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of 

20 the cells of the animal includes a transgene. Other examples of transgenic animals include 
non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic animal 
develops and that remains in the genome of the mature animal, thereby directing the expression 
of an encoded gene product in one or more cell types or tissues of the transgenic animal. As 

25 used herein, a "homologous recombinant animal" is a non-human animal, preferably a mammal, 
more preferably a mouse, in which an endogenous MOLX gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule introduced into a 
cell of the animal, e.g. t an embryonic cell of the animal, prior to development of the animal. 
A transgenic animal of the invention can be created by introducing MOLX-encoding 

30 nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
human MOLX cDNA sequences ofSEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 
and 29 can be introduced as a transgene into the genome of a non-human animal. Alternatively, 
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a non-human homologue of the human MOLX gene, such as a mouse MOLX gene, can be 
isolated based on hybridization to the human MOLX cDNA (described further supra) and used 
as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific regulatory 
5 sequence(s) can be operably-linked to the MOLX transgene to direct expression of MOLX 

protein to particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan, 
1986. In: MANIPULATING THE MOUSE EMBRYO, Cold Spring Harbor Laboratory Press, Cold 

10 Spring Harbor, N.Y. Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of the MOLX transgene in 
its genome and/or expression of MOLX mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene-encoding MOLX protein can further be bred to other 

15 transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at least 
a portion of an MOLX gene into which a deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the MOLX gene. The MOLX gene can be a human gene 
(eg., the cDNA of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29), but 

20 more preferably, is a non-human homologue of a human MOLX gene. For example, a mouse 
homologue of human MOLX gene of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, and 29 can be used to construct a homologous recombination vector suitable for altering an 
endogenous MOLX gene in the mouse genome. In one embodiment, the vector is designed such 
that, upon homologous recombination, the endogenous MOLX gene is functionally disrupted 

25 (/.&, no longer encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous recombination, the 
endogenous MOLX gene is mutated or otherwise altered but still encodes functional protein 
(e,g. , the upstream regulatory region can be altered to thereby alter the expression of the 
endogenous MOLX protein). In the homologous recombination vector, the altered portion of the 

30 MOLX gene is flanked at its 5 ! - and 3-termini by additional nucleic acid of the MOLX gene to 
allow for homologous recombination to occur between the exogenous MOLX gene carried by 
the vector and an endogenous MOLX gene in an embryonic stem cell. The additional flanking 
MOLX nucleic acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5'- and S'-termini) 
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are included in the vector. See, e.g., Thomas, et al, 1987. Cell 51: 503 for a description of 
homologous recombination vectors. The vector is ten introduced into an embryonic stem cell 
line (e.g., by electroporation) and cells in which the introduced MOLX gene has hornologously- 
recombined with the endogenous MOLX gene are selected. See, e.g., Li, et al., 1992. Cell 69: 
5 915. 

The selected cells are then injected into a blastocyst of an animal {e.g., a mouse) to form 
aggregation chimeras. See. e.g., Bradley, 1987. In: TERATOCARQNOMAS AND EMBRYONIC STEM 
CELLS: A Practical APPROACH, Robertson, ed ERL, Oxford, pp. 1 13-152. A chimeric embiyo 
can then be implanted into a suitable pseudopregnant female foster animal and the embryo 

10 brought to term. Progeny harboring the homologously-recombined DNA in their genn cells can 
be used to breed animals in which all cells of the animal contain the homologously-recombined 
DNA by germline transmission of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described further in Bradley, 
1991. Curr. Opin. Biotechnol 2: 823-829; PCT International Publication Nos.: WO 90/11354; 

15 WO 91/01140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI . For a description of the 
cre/loxP recombinase system, See, e.g., Lakso, etal, 1992. Proc. Natl. Acad Sci. USA 89: 

20 6232-6236. Another example of a recombinase system is the FLP recombinase system of 

SaccJiaronryces cerevisiae. See t O'Gonnan, etal., 1991. Science 251:1351-1355. If acre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. Such animals 
can be provided through the construction of "double" transgenic animals, eg., by mating two 

25 transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et ah, 1997. Nature 385: 810-813. In brief, a cell 
(e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the growth 

30 cycle and enter Go phase. The quiescent cell can then be fused, e.g. , through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the quiescent cell 
is isolated The reconstructed oocyte is then cultured such that it develops to morula or 
blastocyte and then transferred to pseudopregnant female foster animal. The offspring borne of 
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this female foster animal will be a clone of the animal from which the cell {e.g., the somatic cell) 
is isolated. 

Pharmaceutical Compositions 

The MOLX nucleic acid molecules, MOLX proteins, and anti-MOLX antibodies (also 
5 referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
and homologs thereof, can be incorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutically acceptable carrier. As used herein, fl phaimaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 

10 antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Suitable carriers are described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% human 

1 5 seium albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used. The 
use of such media and agents for pharmaceutically active substances is well known in the art 
Except insofar as any conventional media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated Supplementary active compounds can also be 
incorporated into the compositions. 

20 A pharmaceutical composition of the invention is formulated to be compatible with its 

intended route of administration. Examples of routes of administration include parenteral, e.g., 
intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (i.e., topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile diluent 

25 such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene 
glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and 
agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 

30 adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. 
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Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersion. For intravenous administration^ suitable carriers 
include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, N. J.) or 
5 phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be 
fluid to the extent that easy syringeability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and 
10 liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by the use of surfactants. Prevention of the 
action of microorganisms can be achieved by various antibacterial and antifungal agents, for 

e 

example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many 
1 5 cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as 
manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound (e.g., 
20 an MOLX protein or anti-MOLX antibody) in the required amount in an appropriate solvent with 
one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 
25 solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
30 administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use 
as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and 
expectorated or swallowed. Pharmaceutical^ compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the 
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like can contain any of the following ingredients, or compounds of a similar nature: a binder 
such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as algmic acid, Primogel, or corn starch; a lubricant such as 
magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent 
5 such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or 
orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g. 9 a gas such 
as carbon dioxide, or a nebulizer. 

1 0 Systemic administration can also be by transmucosal or transdermal means. For 

transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are generally known in the art, and include, for 
example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through the use of nasal sprays or 

15 suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art 

The compounds can also be prepared in the form of suppositories (e.g., with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal 
delivery. 

20 In one embodiment, the active compounds are prepared with carriers that will protect the 

compound against Tapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will 

25 be apparent to those skilled in the art. The materials can also be obtained commercially from 
Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes 
targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as 
phannaceutically acceptable carriers. These can be prepared according to methods known to 
those skilled in the art, for example, as described in U.S . Patent No. 4,522,8 1 1 . 

30 It is especially advantageous to formulate oral or parenteral compositions in dosage unit 

form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 
to physically discrete units suited as unitary dosages for the subject to be treated; each unit 
containing a predetermined quantity of active compound calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. The specification for 
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the dosage unit forms of the invention are dictated by and directly dependent on the unique 
characteristics of the active compound and the particular therapeutic effect to be achieved, and 
the limitations inherent in the art of compounding such an active compound for the treatment of 
individuals. 

5 The nucleic acid molecules of the invention can be inserted into vectors and used as gene 

therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, intravenous 
injection, local administration (see, e.g. t U.S. Patent No. 5,328,470) or by stereotactic injection 
(see, e.g. 9 Chen, et al> 1994. Proc. Natl Acad. Sci. USA 91: 3054-3057). The pharmaceutical 
preparation of the gene therapy vector can include the gene therapy vector in an acceptable 

10 diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact from recombinant 
cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells that 
produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 

1 5 together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express MOLX 
protein (e.g. , via a recombinant expression vector in a host cell in gene therapy applications), to 
detect MOLX mRNA (e.g., in a biological sample) or a genetic lesion in an MOLX gene, and to 

20 modulate MOLX activity, as described further, below. In addition, the MOLX proteins can be 
used to screen drugs or compounds that modulate the MOLX protein activity or expression as 
well as to treat disorders characterized by insufficient or excessive production of MOLX protein 
or production of MOLX protein forms that have decreased or aberrant activity compared to 
MOLX wild-type protein (e.g.; diabetes (regulates insulin release); obesity (binds and transport 

25 lipids); metabolic disturbances associated with obesity, the metabolic syndrome X as well as 
anorexia and wasting disorders associated with chronic diseases and various cancers, and 
infectious disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, 
the anti-MOLX antibodies of the invention can be used to detect and isolate MOLX proteins and 
modulate MOLX activity. In yet a further aspect, the invention can be used in methods to 

30 influence appetite, absorption of nutrients and the disposition of metabolic substrates in both a 
positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 
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Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
5 peptidomimetics, small molecules or other drugs) that bind to MOLX proteins or have a 

stimulatory or inhibitory effect on, e.g., MOLX protein expression or MOLX protein activity. 
The invention also includes compounds identified in the screening assays described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an MOLX 

1 0 protein or polypeptide or biologically-active portion thereof. The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 

15 chromatography selection. The biological library approach is limited to peptide libraries, while 
the other four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds. See, e.g., Lam, 1997 '. Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 

20 molecules can be, e.g. , nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, 
lipids or other organic or inorganic molecules. Libraries of chemical and/or biological mixtures, 
such as fungal, bacterial, or algal extracts, are known in the art and can be screened with any of 
the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, for 

25 example in: DeWitt, et al 9 1993. Proc. Natl. Acad. Scl U.S.A. 90: 6909; Erb, et al 9 1994. Proc. 
Natl Acad. Sci. U.SA. 91: 11422; Zuckermann, et al 9 1994. J. Med. Chem. 37: 2678; Cho, etal> 
1993. Science 261: 1303; Carrell, et al 9 1994. Angew. Chem. Int. Ed. Engl. 33: 2059; Carell, et 
al 9 1994. Aiigew. Chem. Int. Ed. Engl 33: 2061; and Gallop, et al. 9 1994. J. Med. Chem. 37: 
1233. 

30 Libraries of compounds may be presented in solution (e.g. , Houghten, 1 992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 1993. 
Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, U.S. Patent 
5,233,409), plasmids (Cull, et al 9 1992. Proc. Natl Acad. Scl USA 89: 1865-1869) or on phage 
(Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 249: 404-406; Cwirla, et 
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a/., 1990. Proc. Natl. Acad. Sci. U.SA. 87: 6378-6382; Felici, 1991. /. Mol Biol 222: 301-310; 
Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of MOLX protein, or a biologically-active portion thereof, on the cell 
5 surface is contacted with a test compound and the ability of the test compound to bind to an 
MOLX protein determined. The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the MOLX protein can be accomplished, 
for example, by coupling the test compound with a radioisotope or enzymatic label such that 
binding of the test compound to the MOLX protein or biologically-active portion thereof can be 

10 determined by detecting the labeled compound in a complex. For example, test compounds can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test compounds can 
be enzymatically-labeled with, for example, horseradish peroxidase, alkaline phosphatase, or 
luciferase, and the enzymatic label detected by determination of conversion of an appropriate 

1 5 substrate to product. In one embodiment, the assay comprises contacting a cell which expresses 
a membrane-bound form of MOLX protein, or a biologically-active portion thereof, on the cell 
surface with a known compound which binds MOLX to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to interact 
with an MOLX protein, wherein determining the ability of the test compound to interact with an 

20 MOLX protein comprises determining the ability of the test compound to preferentially bind to 
MOLX protein ot a biologically-active portion thereof as compared to the known compound. 

hi another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of MOLX protein, or a biologically-active portion thereof, . 
on the cell surface with a test compound and determining the ability of the test compound to 

25 modulate (e.g., stimulate or inhibit) the activity of the MOLX protein or biologically-active 

portion thereof. Determining the ability of the test compound to modulate the activity of MOLX 
or a biologically-active portion thereof can be accomplished, for example, by determining the 
ability of the MOLX protein to bind to or interact with an MOLX target molecule. As used 
herein, a "target molecule" is a molecule with which an MOLX protein binds or interacts in 

30 nature, for example, a molecule on the surface of a cell which expresses an MOLX interacting 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated with the internal surface of a cell membrane or a cytoplasmic molecule. An 
MOLX target molecule can be a non-MOLX molecule or an MOLX protein or polypeptide of 
the invention. In one embodiment, an MOLX target molecule is a component of a signal 
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transduction pathway that facilitates transduction of an extracellular signal (e.g. a signal 
generated by binding of a compound to a membrane-bound MOLX molecule) through the cell 
membrane and into the cell. The target, for example, can be a second intercellular protein that 
has catalytic activity or a protein that facilitates the association of downstream signaling 
5 molecules with MOLX. 

Determining the ability of the MOLX protein to bind to or interact with an MOLX target 
molecule can be accomplished by one of the methods described above for determining direct 
binding. In one embodiment, determining the ability of the MOLX protein to bind to or interact 
with an MOLX target molecule can be accomplished by determining the activity of the target 

10 molecule. For example, the activity of the target molecule can be determined by detecting 

induction of a cellular second messenger of the target (i.e. intracellular Ca 2+ , diacylglycerol, IP3, 
etc.), detecting catalytic/enzymatic activity of the target an appropriate substrate, detecting the 
induction of a reporter gene (comprising an MOLX-responsive regulatory element operatively 
linked to a nucleic acid encoding a detectable marker, e.g., luciferase), or detecting a cellular 

1 5 response, for example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an MOLX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to bind to the MOLX protein or biologically-active 
portion thereof. Binding of the test compound to the MOLX protein can be determined either 

20 directly or indirectly as described above. In one such embodiment, the assay comprises 

contacting the MOLX protein or biologically-active portion thereof with a known compound 
which binds MOLX to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with an MOLX protein, 
wherein determining the ability of the test compound to interact with an MOLX protein 

25 comprises determining the ability of the test compound to preferentially bind to MOLX or 
biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting MOLX 
protein or biologically-active portion thereof with a test compound and determining the ability of 
the test compound to modulate (eg. stimulate or inhibit) the activity of the MOLX protein or 

30 biologically-active portion thereof. Determining the ability of the test compound to modulate the 
activity of MOLX can be accomplished, for example, by detennining the ability of the MOLX 
protein to bind to an MOLX target molecule by one of the methods described above for 
detennining direct binding. In an alternative embodiment, determining the ability of the test 
compound to modulate the activity of MOLX protein can be accomplished by determining the 



122 



WO 01/81578 



PCT/US01/13578 



ability of the MOLX protein further modulate an MOLX target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be determined 
as described, supra. 

In yet another embodiment, the cell-free assay comprises contacting the MOLX protein 
5 or biologically-active portion thereof with a known compound which binds MOLX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with an MOLX protein, wherein determining the ability 
of the test compound to interact with an MOLX protein comprises determining the ability of the 
MOLX protein to preferentially bind to or modulate the activity of an MOLX target molecule. 

10 The cell-free assays of the invention are amenable to use of both the soluble form or the 

membrane-bound form of MOLX protein. In the case of cell-free assays comprising the 
membrane-bound form of MOLX protein, it may be desirable to utilize a solubilizing agent such 
that the membrane-bound form of MOLX protein is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, 

1 5 n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® 
X-100, Triton® X-l 14, Thesit®, Isotridecypory(ethylene glycol ether) n , N-dodecyl- 
N,N-dimethyl-3 -ammonio- 1 -propane sulfonate, 3-(3-cholamidopropyl) dimethylamminiol- 
1 -propane sulfonate (CHAPS), or 3-(3-cholanndopropyl)d4methylarnrniniol-2-hydroxy- 
1-propane sulfonate (CHAPSO). 

20 In more than one embodiment of the above assay methods of the invention, it may be 

desirable to immobilize either MOLX protein or its target molecule to facilitate separation of 
complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to MOLX protein, or interaction of MOLX 
protein with a target molecule in the presence and absence of a candidate compound, can be 

25 accomplished in any vessel suitable for containing the reactants. Examples of such vessels 

include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided that adds a domain that allows one or both of the proteins to be bound to 
a matrix. For example, GST-MOLX fusion proteins or GST-target fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 

30 derivatized microtiter plates, that are then combined with the test compound or the test 

compound and either the non-adsorbed target protein or MOLX protein, and the mixture is 
incubated under conditions conducive to complex formation (e.g., at physiological conditions for 
salt and pH). Following incubation, the beads or microtiter plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex determined 
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either directly or indirectly, for example, as described, supra. Alternatively, the complexes can 
be dissociated from the matrix, and the level of MOLX protein binding or activity determined 
using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the screening 
5 assays of the invention. For example, either the MOLX protein or its target molecule can be 
immobilized utilizing conjugation of biotin and streptavidin. Biotinylated MOLX protein or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques 
well-known within the art (e.g. , biotinylation kit, Pierce Chemicals, Rockford, 111.), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, 

1 0 antibodies reactive with MOLX protein or target molecules, but which do not interfere with 

binding of the MOLX protein to its target molecule, can be derivatized to the wells of the plate, 
and unbound target or MOLX protein trapped in the wells by antibody conjugation. Methods for 
detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the MOLX 

1 5 protein or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the MOLX protein or target molecule. 

In another embodiment, modulators of MOLX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of MOLX 
mRNA or protein in the cell is determined. The level of expression of MOLX mRNA or protein 

20 in the presence of the candidate compound is compared to the level of expression of MOLX 

mRNA or protein in the absence of the candidate compound. The candidate compound can then 
be identified as a modulator of MOLX mRNA or protein expression based upon this comparison. 
For example, when expression of MOLX mRNA or protein is greater (i.e., statistically 
significantly greater) in the presence of the candidate compound than in its absence, the 

25 candidate compound is identified as a stimulator of MOLX mRNA or protein expression. 

Alternatively, when expression of MOLX mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of MOLX mRNA or protein expression. The level of MOLX mRNA or 
protein expression in the cells can be determined by methods described herein for detecting 

30 MOLX mRNA or protein. 

In yet another aspect of the invention, the MOLX proteins can be used as "bait proteins" 
in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos, et aL 9 
\99l. Celll2\ 223-232; Madura, et al., 1993. J. Biol. Chem. 268: 12046-12054; Bartel, et aU 
1993. Biotechniques 14: 920-924; Iwabuchi, et al, 1993. Oncogene 8: 1693-1696; and Brent 
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WO 94/10300), to identify other proteins that bind to or interact with MOLX ("MOLX-binding 
proteins" or "MOLX-bp") and modulate MOLX activity. Such MOLX-binding proteins are also 
likely to be involved in the propagation of signals by the MOLX proteins as, for example, 
upstream or downstream elements of the MOLX pathway. 
5 The two-hybrid system is based on the modular nature of most transcription factors, 

which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
different DNA constructs. In one construct, the gene that codes for MOLX is fused to a gene 
encoding the DNA binding domain of a known transcription factor (eg., GAL-4). In the other 
construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified 

10 protein ("prey" or "sample") is fused to a gene that codes for the activation domain of the known 
transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming an 
MOLX-dependent complex, the DNA-brnding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene (eg. , 
LacZ) that is operably linked to a transcriptional regulatory site responsive to the transcription 

1 5 factor. Expression of the reporter gene can be detected and cell colonies containing the 

functional transcription factor can be isolated and used to obtain the cloned gene that encodes the 
protein which interacts with MOLX. 

The invention further pertains to novel agents identified by the aforementioned screening 
assays and uses thereof for treatments as described herein. 

20 Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the corresponding 
complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way of 
example, and not of limitation, these sequences can be used to: (I) map their respective genes on 
a chromosome; and, thus, locate gene regions associated with genetic disease; (if) identify an 
25 individual from a minute biological sample (tissue typing); and (Hi) aid in forensic identification 
of a biological sample. Some of these applications are described in the subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the MOLX sequences, SEQ ID 
NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or fragments or derivatives thereof, 
can be used to map the location of the MOLX genes, respectively, on a chromosome. The 
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mapping of the MOLX sequences to chromosomes is an important first step in correlating these 
sequences with genes associated with disease. 

Briefly, MOLX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 1 5-25 bp in length) from the MOLX sequences. Computer analysis of the MOLX, 
5 sequences can be used to rapidly select primers that do not span more than one exon in the 

genomic DNA, thus complicating the amplification process. These primers can then be used for 
PCR screening of somatic cell hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the MOLX sequences will yield an 
amplified fragment 

10 Somatic cell hybrids are prepared by fusing somatic cells from different mammals (e.g., 

human and mouse cells). As hybrids of human and mouse cells grow and divide, they gradually 
lose human chromosomes in random order, but retain the mouse chromosomes. By using media 
in which mouse cells cannot grow, because they lack a particular enzyme, but in which human 
cells can, the one human chromosome that contains the gene encoding the needed enzyme will 

15 be retained. By using various media, panels of hybrid cell lines can be established. Each cell 
line in a panel contains either a single human chromosome or a small number of human 
chromosomes, and a full set of mouse chromosomes, allowing easy mapping of individual genes 
to specific human chromosomes. See, e.g„ DEustachio, et al 9 1983. Science 220: 919-924. 
Somatic cell hybrids containing only fragments of human chromosomes can also be produced by 

20 using human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using a 
single thermal cycler. Using the MOLX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

25 Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 

chromosomal spread can further be used to provide a precise chromosomal location in one step. 
Chromosome spreads can be made using cells whose division has been blocked in metaphase by 
a chemical like colcemid that disrupts the mitotic spindle. Hie chromosomes can be treated 
briefly with trypsin, and then stained with Giemsa. A pattern of light and dark bands develops 

30 on each chromosome, so that the chromosomes can be identified individually. The FISH 
technique can be used with a DNA sequence as short as 500 or 600 bases. However, clones 
larger than 1,000 bases have a higher likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. Preferably 1,000 bases, and more preferably 
2,000 bases, will suffice to get good results at a reasonable amount of time. For a review of this 
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technique, see, Venna, et al, HUMAN CHROMOSOMES: A MANUAL OF BASIC TECHNIQUES 
(Pergamon Press, New Yoik 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for marking 
5 multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding regions of 
the genes actually are preferred for mapping purposes. Coding sequences are more likely to be 
conserved within gene families, thus increasing the chance of cross hybridizations during 
chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

10 position of the sequence on the chromosome can be correlated with genetic map data. Such data 
are found, e.g., in McKusick, MENDELIAN INHERITANCE IN MAN, available on-line through Johns 
Hopkins University Welch Medical Library). The relationship between genes and disease, 
mapped to the same chromosomal region, can then be identified through linkage analysis 
(co-inheritance of physically adjacent genes), described in, e.g., Egeland, et al. } 1987. Nature, 

15 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected arid 
unaffected with a disease associated with the MOLX gene, can be determined. If a mutation is 
observed in some or all of the affected individuals but not in any unaffected individuals, then the 
mutation is likely to be the causative agent of the particular disease. Comparison of affected and 

20 unaffected individuals generally involves first looking for structural alterations in the 

chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PCR based on that DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 

25 

Tissue Typing 

The MOLX sequences of the invention can also be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested with one 
or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
30 identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms," described in U.S. Patent No. 5,272,057). 

Furthermore, the sequences of the invention can be used to provide an alternative 
technique that determines the actual base-by-base DNA sequence of selected portions of an 
individual's genome. Thus, the MOLX sequences described herein can be used to prepare two 
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PCR primers from the 5 f - and 3 -termini of the sequences. These primers can then be used to 
amplify an individual's DNA and subsequently sequence it 

Panels of corresponding DNA sequences from individuals, prepared in this manner, can 
provide unique individual identifications, as each individual will have a unique set of such DNA 
5 sequences due to allelic differences. The sequences of the invention can be used to obtain such 
identification sequences from individuals and from tissue. The MOLX sequences of the 
invention uniquely represent portions of the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between individual humans occurs with a frequency 
10 of about once per each 500 bases. Much of the allelic variation is due to single nucleotide 
polymorphisms (SNPs), which include restriction fragment length polymorphisms (RFLPs). 

Each of the sequences described herein can, to some degree, be used as a standard against 
. which DNA from an individual can be compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, fewer sequences are necessary to 
15 differentiate individuals. The noncoding sequences can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers that each yield a noncoding amplified 
sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID NOS:l, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21,23, 25, 27, and 29 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

20 

Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one aspect of 

25 the invention relates to diagnostic assays for determining MOLX protein and/or nucleic acid 
expression as well as MOLX activity, in the context of a biological sample (e.g. 9 blood, serum, 
cells, tissue) to thereby determine whether an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with aberrant MOLX expression or activity. The 
disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 

30 associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias 5 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 
disorders associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of developing a 
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disorder associated with MOLX protein, nucleic acid expression or activity. For example, 
mutations in an MOLX gene can be assayed in a biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with MOLX protein, nucleic acid expression, or 
5 biological activity. 

Another aspect of the invention provides methods for determining MOLX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents (e.g., drugs) for therapeutic or prophylactic 
1 0 treatment of an individual based on the genotype of the individual (e.g. , the genotype of the 
individual examined to determine the ability of the individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g. 9 
drugs, compounds) on the expression or activity of MOLX in clinical trials. 

These and other agents are described in further detail in the following sections. 

• 15 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of MOLX in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting MOLX protein or nucleic acid (e.g., 

20 mRNA, genomic DNA) that encodes MOLX protein such that the presence of MOLX is detected 
in the biological sample. An agent for detecting MOLX mRNA or genomic DNA is a labeled 
nucleic acid probe capable of hybridizing to MOLX mRNA or genomic DNA. The nucleic acid 
probe can be, for example, a full-length MOLX nucleic acid, such as the nucleic acid of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a portion thereof, such as an 

25 oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 
specifically hybridize under stringent conditions to MOLX mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting MOLX protein is an antibody capable of binding to MOLX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 

30 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')a) can be 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with 
another reagent that is directly labeled. Examples of indirect labeling include detection of a 
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primary antibody using a fluorescently-labeled secondary antibody and end-labeling of a DNA 
probe with biotin such that it can be detected with fluorescently-labeled streptavidin. Hie term 
"biological sample" is intended to include tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present within a subject. That is, the detection method 
5 of the invention can be used to detect MOLX mRNA, protein, or genomic DNA in a biological 
sample in vitro as well as in vivo. For example, in vitro techniques for detection of MOLX 
mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for 
detection of MOLX protein include enzyme linked immunosorbent assays (ELISAs), Western 
blots, immunoprecipitations, and immunofluorescence. In vitro techniques for detection of 

10 MOLX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
detection of MOLX protein include introducing into a subject a labeled anti-MOLX antibody. 
For example, the antibody can be labeled with a radioactive marker whose presence and location 
in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 

1 5 subject. Alternatively, the biological sample can contain mRNA molecules from the test subject 
or genomic DNA molecules from the test subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent capable 

20 of detecting MOLX protein, mRNA, or genomic DNA, such that the presence of MOLX protein, 
mRNA or genomic DNA is detected in the biological sample, and comparing the presence of 
MOLX protein, mRNA or genomic DNA in the control sample with the presence of MOLX 
protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of MOLX in a biological 

25 sample. For example, the kit can comprise: a labeled compound or agent capable of detecting 

MOLX protein or mRNA in a biological sample; means for determining the amount of MOLX in 
the sample; and means for comparing the amount of MOLX in the sample with a standard. The 
compound or agent can be packaged in a suitable container. The kit can further comprise 
instructions for using the kit to detect MOLX protein or nucleic acid. 

30 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify subjects 
having or at risk of developing a disease or disorder associated with aberrant MOLX expression 
or activity. For example, the assays described herein, such as the preceding diagnostic assays or 
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the following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with MOLX protein, nucleic acid expression or activity.- Alternatively, the 
prognostic assays can be utilized to identify a subject having or at risk for developing a disease 
or disorder. Thus, the invention provides a method for identifying a disease or disorder 
5 associated with aberrant MOLX expression or activity in which a test sample is obtained from a 
subject and MOLX protein or nucleic acid (e.g., mRNA, genomic DNA) is detected, wherein the 
presence of MOLX protein or nucleic acid is diagnostic for a subject having or at risk of 
developing a disease or disorder associated with aberrant MOLX expression or activity. As used 
herein, a "test sample" refers to a biological sample obtained from a subject of interest For 

10 example, a test sample can be a biological fluid (e.g. , serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine whether a 
subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant MOLX expression or activity. For example, such methods can be used 

1 5 to determine whether a subject can be effectively treated with an agent for a disorder. Thus, the 
invention provides methods for determining whether a subject can be effectively treated with an 
agent for a disorder associated with aberrant MOLX expression or activity in which a test sample 
is obtained and MOLX protein or nucleic acid is detected (e.g., wherein the presence of MOLX 
protein or nucleic acid is diagnostic for a subject that can be administered the agent to treat a 

20 disorder associated with aberrant MOLX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an MOLX 
gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 

25 genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an MOLX-protein, or the misexnression of the MOLX gene. For example, such 
genetic lesions can be detected by ascertaining the existence of at least one of: (z) a deletion of 
one or more nucleotides from an MOLX gene; (ii) an addition of one or more nucleotides to an 
MOLX gene; (Hi) a substitution of one or more nucleotides of an MOLX gene, (zv) a 

30 chromosomal rearrangement of an MOLX gene; (v) an alteration in the level of a messenger 
RNA transcript of an MOLX gene, (vf) aberrant modification of an MOLX gene, such as of the 
methylation pattern of the genomic DNA, (vii) the presence of a non-wild-type splicing pattern 
of a messenger RNA transcript of an MOLX gene, (viii) a non-wild-type level of an MOLX 
protein, (ix) allelic loss of an MOLX gene, and (x) inappropriate post-translational modification 
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of an MOLX protein. As described herein, there are a large number of assay techniques known 
in the art which can be used for detecting lesions in an MOLX gene. A preferred biological 
sample is a peripheral blood leukocyte sample isolated by conventional means from a subject 
However, any biological sample containing nucleated cells may be used, including, for example, 
5 buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
polymerase chain reaction (PCR) {see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such as 
anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g. 9 
Landegran, et al., 1988. Science 241: 1077-1080; and Nakazawa, et al, 1994. Proc. Natl Acad. 

10 Set USA 91 : 360-364), the latter of which can be particularly useful for detecting point 

mutations in the MOLX-gene (see, Abravaya, et aL, 1995. Nucl. Acids Res. 23: 675-682). This 
method can include the steps of collecting a sample of cells from a patient, isolating nucleic acid 
(e.g-> genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid sample 
with one or more primers that specifically hybridize to an MOLX gene under conditions such 

1 5 that hybridization and amplification of the MOLX gene (if present) occurs, and defecting the 
presence or absence of an amplification product, or detecting the size of the amplification 
product and comparing the length to a control sample. It is anticipated that PCR and/or LCR 
may be desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

20 Alternative amplification methods include: self sustained sequence replication (see, 

Guatelli, etal y 1990. Proc. Natl Acad. Set USA 87: 1874-1878), transcriptional amplification 
system (see, Kwoh, etaU 1989. Proc. Natl. Acad. ScL USA 86: 1173-1177); QP Replicase (see, 
Lizardi, et al, 1988. BioTechnology 6: 1197), or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of 

25 skill in the art. These detection schemes are especially usefiil for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an MOLX gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 

30 endomicleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in the 
sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g., U.S. Patent No. 
5,493,531) can be used to score for the presence of specific mutations by development or loss of 
a ribozyme cleavage site. 
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In other embodiments, genetic mutations in MOLX can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing hundreds 
or thousands of oligonucleotides probes. See, e.g. f Cronin, et al, 1996. Human Mutation 7: 
244-255; Kozal, et al., 1996. Nat Med. 2: 753-759. For example, genetic mutations in MOLX 
5 can be identified in two dimensional arrays containing light-generated DNA probes as described 
in Cronin, et al., supra. Briefly, a first hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify base changes between the sequences 
by making linear arrays of sequential overlapping probes. This step allows the identification of 
point mutations. This is followed by a second hybridization array that allows the 

10 characterization of specific mutations by using smaller, specialized probe arrays complementary 
to all variants or mutations detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art can 
be used to directly sequence the MOLX gene and detect mutations by comparing the sequence of 

1 5 the sample MOLX with the corresponding wild-type (control) sequence. Examples of 

sequencing reactions include those based on techniques developed by Maxim and Gilbert, 1977. 
Proc. Natl Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl Acad. Sci. USA 74: 5463. It is 
also contemplated that any of a variety of automated sequencing procedures can be utilized when 
performing the diagnostic assays (see, e.g., Naeve, et al, 1995. Biotechniques 19: 448), 

20 including sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
94/16101; Cohen, et al, 1996. Adv. Chromatography 36: 127-162; and Griffin, et al, 1993. 
Appl Biochem. Biotechnol 38: 147-159). 

Other methods for detecting mutations in the MOLX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA ot RNA/DNA 

25 heteroduplexes. See, e.g., Myers, et al, 1985. Science 230: 1242. In general, the art technique 
of "mismatch cleavage" starts by providing heteroduplexes of formed by hybridizing (labeled) 
RNA or DNA containing the wild-type MOLX sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent that 
cleaves single-stranded regions of the duplex such as which will exist due to basepair 

30 mismatches between the control and sample strands. For instance, RNA/DNA duplexes can be 
treated with RNase and DNA/DNA hybrids treated with S] nuclease to en2ymatically digesting 
the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is then 
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separated by size on denaturing polyacrylamide gels to determine the site of mutation. See, e.g., 
Cotton, et al, 1988. Proc. Natl. Acad. Set USA 85: 4397; Saleeba, et al, 1992. Methods 
Enzymol 217: 286-295. In an embodiment, the control DNA or RNA can be labeled for 
detection. 

5 In still another embodiment, the mismatch cleavage reaction employs one or more 

proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
MOLX cDNAs obtained from samples of cells. For example, the mutY enzyme of R. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at 

10 G/T mismatches. See, e.g., Hsu, et al., 1994. Carcinogenesis 15: 1657-1662. According to an 
exemplary embodiment, a probe based on an MOLX sequence, e.g., a wild-type MOLX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected 
from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

1 5 In other embodiments, alterations in electrophoretic mobility will be used to identify 

mutations in MOLX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids. See, e.g, Orita, et al., 1989. Proc. Natl. Acad. Sci. USA: 86: 2766; Cotton, 1993. 
Mutat Res. 285: 125-144; Hayashi, 1992. Genet. Anal Tech. Appl 9: 73-79. Single-stranded 

20 DNA fragments of sample and control MOLX nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to sequence, 
the resulting alteration in electrophoretic mobility enables the detection of even a single base 
change. The DNA fragments may be labeled or detected with labeled probes. The sensitivity of 
the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is 

25 more sensitive to a change in sequence. In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, etal, 1991. Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient gel 

30 electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. When DGGE is used 
as the method of analysis, DNA will be modified to insure that it does not completely denature, 
for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by 
PCR. In a further embodiment, a temperature gradient is used in place of a denaturing gradient 
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to identify differences in the mobility of control and sample DNA. See, e.g, Rosenbaum and 
Reissner, 1987. Biophys. Chenu 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer extension. 
5 For example, oligonucleotide primers may be prepared in which the known mutation is placed 
centrally and then hybridized to target DNA under conditions that permit hybridization only if a 
perfect match is found. See, e.g, Saiki, et al. % 1986. Nature 324: 163; Saiki, et a/., 1989. Proc. 
Natl Acad. Sci. USA 86: 6230. Such allele specific oligonucleotides are hybridized to PCR 
amplified target DNA or a number of different mutations when the oligonucleotides are attached 

10 to the hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the molecule 
(so that amplification depends on differential hybridization; see, e.g., Gibbs, et a/., 1989. Nucl 

15 Acids Res. 17: 2437-2448) or at the extreme 3'-termrnus of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (see, e.g., Prossner, 1993. 
Tibteck 1 1: 238). In addition it may be desirable to introduce a novel restriction site in the 
region of the mutation to create cleavage-based detection. See, e.g., Gasparini, et al, 1992. Mol 
Cell Probes 6:1. It is anticipated that in certain embodiments amplification may also be 

20 performed using Taq ligase for amplification. See, e.g., Barany, 1991. Proc. Natl Acad. Set 

USA 88: 189. In such cases, ligation will occur only if there is a perfect match at the 3 f -terminus 
of the 5' sequence, making it possible to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing pre-packaged 

25 diagnostic kits comprising at least one probe nucleic acid or antibody reagent described herein, 
which may be conveniently used, e.g., in clinical settings to diagnose patients exhibiting 
symptoms or family history of a disease or illness involving an MOLX gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
MOLX is expressed may be utilized in the prognostic assays described herein. However, any 

30 biological sample cmtaining nucleated cells may be used, including, for example, buccal 
mucosal cells. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on MOLX activity 
(e.g., MOLX gene expression), as identified by a screening assay described herein can be 
5 administered to individuals to treat (prophylacticalry or therapeutically) disorders (The disorders 
include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated 
cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, 
immune disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 

10 associated with chronic diseases and various cancers.) In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype and that 
individual's response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood concentration of the pharmacologically active drug. 

15 Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., 
drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Such pharmacogenomics can further be used to determine appropriate dosages and 
therapeutic regimens. Accordingly, the activity of MOLX protein, expression of MOLX nucleic 
acid, or mutation content of MOLX genes in an individual can be determined to thereby select 

20 appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the response 

to drugs due to altered drug disposition and abnormal action in affected persons. See e.g, 

i 

Eichelbaum, 1996. Clin. Exp. Pharmacol Physiol, 23: 983-985; Linder, 1997. Clin. Chem., 43: 
254-266. In general, two types of pharmacogenetic conditions can be differentiated. Genetic 

25 conditions transmitted as a single factor altering the way drugs act on the body (altered drug 

action) or genetic conditions transmitted as single factors altering the way the body acts on drugs 
(altered drug metabolism). These pharmacogenetic conditions can occur either as rare defects or 
as polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a 
common inherited enzymopathy in which the main clinical complication is hemolysis after 

30 ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g, N-acetyltransferase 2 (NAT 2) and 
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cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response and 
serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are 
expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
metabolizer (PM). The prevalence of PM is different among different populations. For example, 
the gene coding for CYP2D6 is highly polymorphic and several mutations have been identified 
in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and 
CYP2C19 quite frequently experience exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic 
response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 
metabolite morphine. At the other extreme are the so called ultra-rapid metabolizers who do not 
respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been 
identified to be due to CYP2D6 gene amplification. 

Thus, the activity of MOLX protein, expression of MOLX nucleic acid, or mutation 
content of MOLX genes in an individual can be determined to thereby select appropriate agent(s) 
for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies 
can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with an MOLX 
modulator, such as a modulator identified by one of the exemplary screening assays described 
herein. 

Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or activity 
of MOLX {e.g., the ability to modulate aberrant cell proliferation and/or differentiation) can be 
applied not only in basic drug screening, but also in clinical trials. For example, the 
effectiveness of an agent determined by a screening assay as described herein to increase MOLX 
gene expression, protein levels, or upregulate MOLX activity, can be monitored in clinical trails 
of subjects exhibiting decreased MOLX gene expression, protein levels, or downregulated 
MOLX activity. Alternatively, the effectiveness of an agent determined by a screening assay to 
decrease MOLX gene expression, protein levels, or downregulate MOLX activity, can be 
monitored in clinical trails of subjects exhibiting increased MOLX gene expression, protein 
levels, or upregulated MOLX activity. In such clinical trials, the expression or activity of 
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MOLX and, preferably, other genes that have been implicated in, for example, a cellular 
proliferation or immune disorder can be used as a "read out" or markers of the immune 
responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including MOLX, that are modulated in 
5 cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates MOLX 
activity (e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on cellular proliferation disorders, for example, in a clinical trial, cells 
can be isolated and RNA prepared and analyzed for the levels of expression of MOLX and other 
genes implicated in the disorder. The levels of gene expression (z.e., a gene expression pattern) 

10 can be quantified by Northern blot analysis or RT-PCR, as described herein, or alternatively by 
measuring the amount of protein produced, by one of the methods as described herein, or by 
measuring the levels of activity of MOLX or other genes. In this manner, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the agent 
Accordingly, this response state may be determined before, and at various points during, 

1 5 treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g. t an agonist, antagonist, protein, peptide, 
peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the screening 
assays described herein) comprising the steps of (i) obtaining a pre-a(todinistration sample from a 

20 subject prior to administration of the agent, (n) detecting the level of expression of an MOLX 
protein, mRNA, or genomic DNA in the preadrmnistration sample; (Hi) obtaining one or more 
post-administration samples from the subject; (zv) detecting the level of expression or activity of 
the MOLX protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the MOLX protein, mRNA, or genomic DNA in the 

25 pre-administration sample with the MOLX protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (w) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to increase the 
expression or activity of MOLX to higher levels than detected, i.e., to increase the effectiveness 
of the agent. Alternatively, decreased administration of the agent may be desirable to decrease 

30 expression or activity of MOLX to lower levels than detected, i.e., to decrease the effectiveness 
of the agent. 
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Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
MOLX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
5 hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 
1 0 idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 
Ostoeodystrophy, and other diseases, disorders and conditions of the like. 
These methods of treatment will be discussed more fully, below. 

1 5 Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize (Le., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be 

20 utilized include, but are not limited to: (i) an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; (ii) antibodies to an aforementioned peptide; (Hi) nucleic acids 
encoding an aforementioned peptide; (iv) administration of antisense nucleic acid and nucleic 
acids that are "dysfunctional" (i.e., due to a heterologous insertion within the coding sequences 
of coding sequences to an aforementioned peptide) that are utilized to "knockout" endoggenous 

25 function of an aforementioned peptide by homologous recombination (see, e.g., Capecchi, 1989. 
Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, agonists and antagonists, including 
additional peptide mimetic of the invention or antibodies specific to a peptide of the invention) 
that alter the interaction between an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 

30 suffering from the disease or disorder) levels or biological activity may be treated with 

Therapeutics that increase (Le. 9 are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized 
include, but are not limited to, an aforementioned peptide, or analogs, derivatives, fragments or 
homologs thereof, or an agonist that increases bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, 
by obtaining a patient tissue sample {e.g., from biopsy tissue) and assaying it in vitro for RNA or 
peptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not limited 
5 to, immunoassays {e.g., by Western blot analysis, irnmunoprecipitation followed by sodium 
dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or 
hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ 
hybridization, and the like). 

1 0 Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant MOLX expression or activity, by administering to the 
subject an agent that modulates MOLX expression or at least one MOLX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant MOLX expression or activity can be 

15 identified by, for example, any or a combination of diagnostic or prognostic assays as described 
herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the MOLX aberrancy, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending upon the type of MOLX aberrancy, for 
example, an MOLX agonist or MOLX antagonist agent can be used for treating the subject The 

20 appropriate agent can be determined based on screening assays described herein. The 
prophylactic methods of the invention are further discussed in the following subsections. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating MOLX expression or 
25 activity for therapeutic purposes. The modulatory method of the invention involves contacting a 
cell with an agent that modulates one or more of the activities of MOLX protein activity 
associated with the cell. An agent that modulates MOLX protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of an 
MOLX protein, a peptide, an MOLX peptidomimetic, or other small molecule. In one 
30 embodiment, the agent stimulates one or more MOLX protein activity. Examples of such 

stimulatory agents include active MOLX protein and a nucleic acid molecule encoding MOLX 
that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
MOLX protein activity. Examples of such inhibitory agents include antisense MOLX nucleic 
acid molecules and anti-MOLX antibodies. These modulatory methods can be performed in 
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vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an MOLX 
protein or nucleic acid molecule. In one embodiment, the method involves adniinistering an 
5 agent (e.g. , an agent identified by a screening assay described herein), or combination of agents 
that modulates (e.g., up-regulates or down-regulates) MOLX expression or activity. In another 
embodiment, the method involves adniinistering an MOLX protein or nucleic acid molecule as 
therapy to compensate for reduced or aberrant MOLX expression or activity. 

Stimulation of MOLX activity is desirable in situations in which MOLX is abnormally 
10 downregulated and/or in which increased MOLX activity is likely to have a beneficial effect. 
One example of such a situation is where a subject has a disorder characterized by aberrant cell 
proliferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease (e.g. , preclampsia). 

Determination of the Biological Effect of the Therapeutic 

15 In various embodiments of the invention, suitable in vitro or in vivo assays are performed 

to determine the effect of a specific Therapeutic and whether its administration is indicated for 
treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 

20 the desired effect upon the cell type(s). Compounds for use in therapy may be tested in suitable 
animal model systems including, but not limited to rats, mice, chicken, cows, monkeys, rabbits, 
and the like, prior to testing in human subjects. Similarly, for in vivo testing, any of the animal 
model system known in the art may be used prior to administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

25 The MOLX nucleic acids and proteins of the invention are useM in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated 
cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, hematopoietic disorders, and the various dyslipidemias, metabolic disturbances 

30 associated with obesity, the metabolic syndrome X and wasting disorders associated with 

chronic diseases and various cancers. 

As an example, a cDNA encoding the MOLX protein of the invention may be useful in 

gene therapy, and the protein may be useful when administered to a subject in need thereof. By 
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way of non-limiting example, the compositions of the invention will have efficacy for treatment 
of patients suffering from: metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, 
Parkinson's Disorder, immune disorders, hematopoietic disorders, and the various dyslipidemias. 
5 Both the novel nucleic acid encoding the MOLX protein, and the MOLX protein of the 

invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A further use could be 
as an anti-bacterial molecule {i.e., some peptides have been found to possess anti-bacterial 
properties). These materials are further useful in the generation of antibodies which 
10 immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

Examples 

Example 1: Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 

15 containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR (RTQ PCR; TAQMAN*). RTQ PCR was performed 
on a Perkin-Elmer Biosystems ABI PRISM® 7700 Sequence Detection System. Various 
coflections of samples are assembled on the plates, and referred to as Panel 1 (containing cells 
and cell lines from normal and cancer sources), Panel 2 (containing samples derived from 

20 tissues, in particular from surgical samples, from normal and cancer sources), Panel 3 

(containing samples derived from a wide variety of cancer sources) and Panel 4 (containing cells 
and cell lines from normal cells and cells related to inflammatory conditions). 

First, the RNA samples were normalized to constitutively expressed genes such as P - 
actin and GAPDH. RNA (~50 ng total or ~1 ng poIyA+) was converted to cDNA using the 

25 TAQMAN® Reverse Transcription Reagents Kit (PE Biosystems, FosteT City, CA; Catalog No. 
N808-0234) and random hexamers according to the manufacturer's protocol. Reactions were 
performed in 20 ul and incubated for 30 min. at 48°C. cDNA (5 ul) was then transferred to a 
separate plate for the TAQMAN® reaction using (3-actin and GAPDH TAQMAN® Assay 
Reagents (PE Biosystems; Catalog Nos. 4310881E and 4310884E, respectively) and 

30 TAQMAN® universal PCR Master Mix (PE Biosystems; Catalog No. 4304447) according to the 
manufacturer's protocol. Reactions were performed in 25 ul using the following parameters: 2 
min. at 50°C; 10 min. at 95°C; 15 sec. at 95°C/1 min. at 60°C (40 cycles). Results were recorded 
as CT values (cycle at which a given sample crosses a threshold level of fluorescence) using a 
log scale, with the difference in RNA concentration between a given sample and the sample with 
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the lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and multiplying by 
100. The average CT values obtained for B-actin and GAPDH were used to normalize RNA 
samples. The RNA sample generating the highest CT value required no further diluting, while 
5 all other samples were diluted relative to this sample according to their 3 -actin /GAPDH 
average CT values. 

Normalized RNA (5 ul) was converted to cDNA and analyzed via TAQMAN® using 
One Step RT-PCR Master Mix Reagents (PE Biosystems; Catalog No. 4309169) and gene- 
specific primers according to the manufacturer's instructions. Probes and primers were designed 

10 for each assay according to Perkin Elmer Biosystem's Primer Express Software package 

(version I for Apple Computer's Macintosh Power PC) or a similar algorithm using the target 
sequence as input Default settings were used for reaction conditions and the following 
parameters were set before selecting primers: primer concentration = 250 nM, primer melting 
temperature (T m ) range = 58°-60° C, primer optimal Tm = 59° C, maximum primer difference = 

15 2° C, probe does not have 5' G, probe T m must be 10° C greater than primer T m , amplicon size 75 
bp to 100 bp. The probes and primers selected (see below) were synthesized by Synthegen 
(Houston, TX, USA). Probes were double purified by HPLC to remove uncoupled dye and 
evaluated by mass spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' 
ends of the probe, respectively. Their final concentrations were: forward and reverse primers, 

20 900 nM each, and probe, 200nM. 

PCR conditions: Normalized RNA from each tissue and each cell line was spotted in 
each well of a 96 well PCR plate (Perkin Elmer Biosystems). PCR cocktails including two 
probes (a probe specific for the target clone and another gene-specific probe multiplexed with the 
target probe) were set up using IX TaqMan™ PCR Master Mix for the PE Biosystems 7700, 

25 with5rrMMgC12,dNTTs^ 

Biosystems), and 0.4 U/pl RNase inhibitor, and 0.25 U/^il reverse transcriptase. Reverse 
transcription was performed at 48° C for 30 minutes followed by amplification/PCR cycles as 
follows: 95° C 10 min, then 40 cycles of 95° C for 15 seconds, 60° C for 1 minute. 

30 

In the results for Panel 1, the following abbreviations are used: 

ca. = carcinoma, 

* = established from metastasis, 

met = metastasis, 
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s cell var= small cell variant, 
non-s = non-sm =non-small, 
squam = squamous, 
pi. eff = pi effusion = pleural effusion, 
5 glio = glioma, 

astro = astrocytoma, and 
neuro = neuroblastoma. 

10 Panel 2 



The plates for Panel 2 generally include 2 control wells and 94 test samples composed of 
RNA or cDNA isolated from human tissue procured by surgeons working in close cooperation 
with the National Cancer Institute's Cooperative Human Tissue Network (CHTN) or the 

1 5 National Disease Research Initiative (NDRI). The tissues are derived from human malignancies 
and in cases where indicated many malignant tissues have "matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues and are 
denoted "NAT 5 in the results below. The tumor tissue and the "matched margins" are evaluated 
by two independent pathologists (the surgical pathologists and again by a pathologists at NDRI 

20 or CHTN). This analysis provides a gross histopathological assessment of tumor differentiation 
grade. Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patient. These matched margins are taken from the 
tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated "NAT", for 
normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples were obtained from 

25 various human tissues derived from autopsies performed on elderly people or sudden death 

victims (accidents, etc.); These tissue were ascertained to be free of disease and were purchased 
from various commercial sources such as Clontech (Palo Alto, CA), Research Genetics, and 
Invitrogen. 

RNA integrity from all samples is controlled for quality by visual assessment of agarose. 
30 gel electropherograms using 28S and 1 8S ribosomal RNA staining intensity ratio as a guide (2: 1 
to 2.5:1 28s:18s) and the absence of low molecular weight RNAs that would be indicative of 
degradation products. Samples are controlled against genomic DNA contamination by RTQ 
PCR reactions run in the absence of reverse transcriptase using probe and primer sets designed to 
amplify across the span of a single exon. 

35 
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Panel 4 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) composed 
of RNA (Panel 4r) or cDNA (Panel 4d) isolated from various human cell lines or tissues related 
5 to inflammatory conditions. Total RNA from control normal tissues such as colon and lung 

(Stratagene ,La Jolla, CA) and thymus and kidney (Clontech) were employed. Total RNA from 
liver tissue from cirrhosis patients and kidney from lupus patients was obtained from BioChain 
(Biochain Institute, Inc., Hayward, CA). Intestinal tissue for RNA preparation from patients 
diagnosed as having Crohn's disease and ulcerative colitis was obtained from the National 
10 Disease Research Interchange (NDRI) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle cells, 
small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human umbilical 
vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and grown in the 

15 media supplied for these cell types by Clonetics. These primary cell types were activated with 
various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as indicated. The 
following cytokines were used; IL-1 beta at approximately 1-5 ng/ml, TNF alpha at 
approximately 5-10 ng/ml, IFN gamma at approximately 20-50 ng/ml, IL-4 at approximately 5- 
10 ng/ml, IL-9 at approximately 5-10 ng/ml, IL-13 at approximately 5-10 ng/ml. Endothelial 

20 cells were sometimes starved for various times by culture in the basal media from Clonetics with 
0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen Corporation, 
using Ficoll. LAK cells were prepared from these cells by culture in DMEM 5% FCS 
(Hyclone), 100 uM non essential amino acids (Gibco/Life Technologies, Rockville, MD), 1 mM 

25 sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM Hepes (Gibco) and 
Interleukin 2 for 4-6 days. Cells were then either activated with 10-20 ng/ml PMA and 1-2 
fig/ml ionomycin, IL-12 at 5-10 ng/ml, IFN gamma at 20-50 ng/ml and IL-18 at 5-10 ng/ml for 6 
hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 5% FCS 
(Hyclone), 100 uM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 

30 mercaptoethanol 5.5 x 10' 5 M (Gibco), and 10 mM Hepes (Gibco) with PHA 

(phytqhemagglutinin) or PWM (pokeweed mitogen) at approximately 5 ug/ml. Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll and 
mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 2x1 0 6 
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cellsAnl in DMEM 5% FCS (Hyclone), 100 fiM non essential amino acids (Gibco), 1 mM 
sodium pyruvate (Gibco), mercaptoethanol (5.5 x 10" 5 M) (Gibco), and 10 mM Hepes (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1-7 days for RNA 
preparation. 

5 Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve VS 

selection columns and a Vario Magnet according to the manufacturer's instructions. Monocytes 
were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum (FCS) 
(Hyclone, Logan, UT), 100 uM non essential amino acids (Gibco), 1 mM sodium pyruvate 
(Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM Hepes (Gibco), 50 ng/ml GMCSF 

1 0 and 5 ng/ml JL-4 for 5-7 days. Macrophages were prepared by culture of monocytes for 5-7 days 
in DMEM 5% FCS (Hyclone), 100 pM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), 10 mM Hepes (Gibco) and 10% AB 
Human Serum or MCSF at approximately 50 ng/ml. Monocytes, macrophages and dendritic cells 
were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 100 ng/ml. Dendritic 

1 5 cells were also stimulated with anti-CD40 monoclonal antibody (Pharmingen) at 10 jag/ml for 6 
and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from mononuclear 
cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection columns and a Vario 
Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 lymphocytes 

20 were isolated by depleting mononuclear cells of CD8, CD56, CD14 and CD1 9 cells using CD8, 
CD56, CD14 and CD19 Miltenyi beads and +ve selection. Then CD45RO beads were used to 
isolate the CD45RO CD4 lymphocytes with the remaining cells being CD45RA CD4 
lymphocytes. CD45RA CD4, CD45RO CD4 and CD8 lymphocytes were placed in DMEM 5% 
FCS (Hyclone), 100 pM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 

25 mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM Hepes (Gibco) and plated at 10 6 cells/ml onto 
Falcon 6 well tissue culture plates that had been coated overnight with 0.5 pg/ml anti-CD28 
(Pharmingen) and 3 ug/ml anti-CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells 
were harvested for RNA preparation. To prepare chronically activated CD8 lymphocytes, we 
activated the isolated CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and 

30 then harvested the cells and expanded them in DMEM 5% FCS (Hyclone), 1 00 pM non essential 
amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), 
and 10 mM Hepes (Gibco) and EL-2. The expanded CD8 cells were then activated again with 
plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was isolated 6 
and 24 hours after the second activation and after 4 days of the second expansion culture. The 
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isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 100 pM non essential amino 
acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoetbanol 5.5 x 10~ 5 M (Gibco), and 10 
mM Hepes (Gibco) and IL-2 for 4-6 days before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRL The tonsil was cut up with sterile 
5 dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), 100 jiM non essential amino acids 
(Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM 
Hepes (Gibco). To activate the cells, we used PWM at 5 ug/ml or anti-CD40 (Pharmingen) at 
approximately 10 fig/ml and IL-4 at 5-10 ng/ml. Cells were harvested for RNA preparation at 
10 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates were 
coated overnight with 10 ug/ml anti-CD28 (Pharmingen) and 2 ug/ml OKT3 (ATCC), and then 
washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, German 

5 6 

Town, MD) were cultured at 10 -10 cells/ml in DMEM 5% FCS (Hyclone), 100 pM non 
15 essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M 
(Gibco), 10 mM Hepes (Gibco) and EL-2 (4 ng/ml). IL-12 (5 ng/ml) and anti-IL4 (1 Dg/ml) 
were used to direct to Thl , while IL-4 (5 ng/ml) and anti-EFN gamma (1 □ g/ml) were used to 
direct to Th2 and IL-10 at 5 ng/ml was used to direct to Trl . After 4-5 days, the activated Thl, 
Th2 and Trl lymphocytes were washed once in DMEM and expanded for 4-7 days in DMEM 
20 5% FCS (Hyclone), 100 \iM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x 10" 5 M (Gibco), 10 mM Hepes (Gibco) and IL-2 (1 ng/ml). Following 
this, the activated Thl , Th2 and Trl lymphocytes were re-stimulated for 5 days with anti- 
CD28/OKT3 and cytokines as described above, but with the addition of anti-CD95L (1 Dg/ml) 
to prevent apoptosis. After 4-5 days, the Thl, Thl and Trl lymphocytes were washed and then 
25 expanded again with 11^2 for 4-7 days. Activated Thl and Th2 lymphocytes were maintained in 
this way for a maximum of three cycles. RNA was prepared from primary and secondary Thl, 
Th2 and Trl after 6 and 24 hours following the second and third activations with plate bound 
anti-CD3 and anti-CD28 mAbs and 4 days into the second and third expansion cultures in 
Interleukin 2. 

30 The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, KU- 

812. EOL cells were further differentiated by culture in 0.1 mM dbcAMP at 5 xlO 5 cells/ml for 

8 days, changing the media every 3 days and adjusting the cell concentration to 5 xlO 5 cells/ml. 

For the culture of these cells, we used DMEM or RPMI (as recommended by the ATCC), with 

the addition of 5% FCS (Hyclone), 100 uM non essential amino acids (Gibco), 1 mM sodium 
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pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), 10 mM Hepes (Gibco). RNA was 
either prepared from resting cells or cells activated with PMA at 10 ng/ml and ionomycin at 1 
lig/ml for 6 and 14 hours. Keratinocyte line CCD106 and an airway epithelial tumor line NCI- 
H292 were also obtained from the ATCC. Both were cultured in DMEM 5% FCS (Hyclone), 
5 100 non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 
x 10' 5 M (Gibco), and 10 mM Hepes (Gibco). CCD1 106 cells were activated for 6 and 14 hours 
with approximately 5 ng/ml TNF alpha and 1 ng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and 14 hours with the following cytokines: 5 ng/ml IL-4, 5 ng/ml IL-9, 5 ng/ml 
IL-13 and 25 ng/ml IFN gamma. 

10 For these cell lines and blood cells, RNA was prepared by lysing approximately 10 7 

cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane (Molecular 
Research Corporation) was added to the RNA sample, vortexed and after 1 0 minutes at room 
temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. The aqueous phase was 
removed and placed in a 15 ml Falcon Tube. An equal volume of isopropanol was added and left 

15 at -20 degrees C overnight The precipitated RNA was spun down at 9,000 rpm for 15 min in a 
Sorvall SS34 rotor and washed in 70% ethanol. The pellet was redissolved in 300 pi of RNAse- 
free water and 35 \xl buffer (Promega) 5 ^1 DTT, 7 |d RNAsin and 8 |d DNAse were added. The 
tube was incubated at 37 degrees C for 30 minutes to remove contaminating genomic DNA, 
extracted once with phenol chloroform and re-precipitated with 1/10 volume of 3 M sodium 

20 acetate and 2 volumes of 1 00% ethanol. The RNA was spun down and placed in RNAse free 
water. RNA was stored at -80 degrees C. 

MOL1 

Expression of gene GMJ79960178 was assessed using the primer-probe set Agl605, 
described in Table 12. Results of the RTQ-PCR runs are shown in Tables 1 3 and 14. 

25 

Table 12. Probe Name: Agl605 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5'-CCTGAGCTACAACAACATCATG-3' 


58.3 


22 


333 


76 


Probe 


FAM-5'- 

CCTCATATCCCTGTCCCTCAGCCATA- 
3'-TAMRA 


69 


26 


378 


77 


Reverse 


5'-GCAGAGTCTAGCATCAGGATGT-3' 


58.6 


22 


407 


78 
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Table 13. Panels 1.3D and 2D 



PANEL 1.3D 




PANEL 2D 






Tissue Name 


Rel. Expr., 
% 

1.3dx4tm54 
02fagl605 
bl 


Tissue Name 


Rel. Expr., 
% 

2Dtm2728f 
agl605 " 


Rel. Expr., 
% 

2dx4tm4732 
f_agl605_a 
2 


Liver 

adenocarcinoma 


1.9 


Normal Colon GENPAK 
061003 


70.2 


70.3 j 


Pancreas 


0 


83219 CC Well to Mod Diff 
(OD03866) 


8.5 


6.3 


Pancreatic ca. 
CAP AN 2 


17.6 


83220 CC NAT (OD03866) 


O 1 

8.1 


A A 

9.4 


A J 1 1 J 

Adrenal gland 


1.3 


83221 CC Gr.2 rectosigmoid 
(OD03868) 


A C 

4.5 


12.1 


Thyroid 


5.6 


83222 CCNAT(OD03868) 


1.6 


6.3 


Salivary gland 


8.9 


83235 CC Mod DirT 
(ODO3920) 


1 *T A 

17.9 


19.7 


Pituitary gland 


1.7 


83236 CC NAT (ODO3920) 


0 1 c 

21.5 


1 1.0 


Brain (fetal) 


21.2 


83237 CC Gr.2 ascend colon 
(OD03921) 


18 


34.4 


Brain (whole) 


30.5 


83238 CC NAT (OD03921) 


8.9 


9.8 


Brain 
(amygdala) 


6.6 


83241 CC from Partial 
Hepatectomy (ODO4309) 


7.1 


4.9 


Brain 

(cerebellum) 


14.6 


83242 Liver NAT (ODO4309) 


17.1 


6.9 


Brain 

(hippocampus) 


8.7 


87472 Colon mets to lung 
(OD04451-01) 


6.7 


11.4 


Brain 

(substantia 

nigra) 


9.3 


87473 Lung NAT (OD04451- 
02) 


0.8 


2.9 


Brain 
(thalamus) 


4.8 


Normal Prostate Clontech A+ 
6546-1 


17.8 


52.9 


Cerebral Cortex 


2.6 


84140 Prostate Cancer 
(OD04410) 


18.6 


23 


Spinal cord 


15.7 


84141 Prostate NAT 
(OD04410) 


9.1 


22 


CNSca. 

(glio/astro) 

U87-MG 


0 


87073 Prostate Cancer 
(OD04720-01) 


20 


14.5 


CNS ca. 
(glio/astro) U- 
118-MG - 


1.5 


87074 Prostate NAT 
(OD04720-02) 


21.3 


38.1 


CNS ca. (astro) 
SW1783 


0 


Normal Lung GENPAK 
061010 


69.7 


85.9 
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CNS ca.* 
(neuro; met ) 
SK-N-AS 


1.6 


83239 Lung Met to Muscle 
(OD04286) 


11.1 


6.3 


CNS ca. (astro) 
SF-539 


0 


83240 Muscle NAT 
(OD04286) 


15.1 


23.3 


CNS ca. (astro) 
SNB-75 


0 


841 36 Lung Malignant Cancer 
(OD03126) 


30.6 


30.8 


CNS ca. (glio) 
SNB-19 


9.7 


841 37 Lung NAT (OD03 126) 


8 


16.3 


CNS ca. (glio) 
U251 


4.8 


84871 Lung Cancer 
(OD04404) 


41.8 


45.2 


CNS ca. (glio) 
SF-295 


7.7 


84872 Lung NAT (OD04404) 


15.4 


27.5 


Heart (fetal) 


0.9 


84875 Lung Cancer 
(OD04565) 


7.4 


8.5 


Heart 


6.1 


84876 Lung NAT (OD04565) 


5.3 


1.2 


Fetal Skeletal 


3.5 


85950 Lung Cancer 
fOD04237-0n 


77.9 


72.9 


Skeletal muscle 


100 


85970 Lung NAT (OD04237- 
02) 


14.3 


12.9 


Bone marrow 


22.8 


£3255 Ocular Mel Met to 
Liver (ODO4310) 


2.7 


6 


Thymus 


12.2 


83256 Liver NAT (ODO4310) 


2.3 


5.1 


Spleen 


14.6 


84 1 3 9 Melanoma Mets to 
Lung (OD04321) 


1.3 


6.2 


Lymph node 


57.9 


m\H Lung NAT (OD04321) 


36.9 


22.4 


Colorectal 


3.4 


Normal Kidney GENPAK 
061008 


9.2 


19.2 


Stomach 


20.7 


83786 Kidney Ca, Nuclear 
grade 2 (OD04338) 


4.2 


6.9 


Small intestine 


33.5 


83787 Kidney NAT 
(OD04338) 


7 


8.3 


Colon ca. 
SW480 


0.6 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


18.7 


11.3 


Colon ca.* 

(SW480 

met)SW620 


0 


83789 Kidney NAT 
(OD04339) 


4.9 


8.5 


Colon ca. 
HT29 


0 


83790 Kidney Ca, Clear cell 
type (OD04340) 


6.3 . 


14.8 


Colon ca. 
HCT-116 


0 


83791 Kidney NAT 
(OD04340) 


10.7 


14.6 


Colon ca. 
CaCo-2 


2.2 


83792 Kidney Ca, Nuclear 
grade 3 (OD04348) 


6 


7.5 


83219 CCWeU 
to Mod Diff 
(OD03866) 


0 


83793 Kidney NAT 
(OD04348) 


14.5 


16.5 
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Colon ca. 
HCC-2998 


0 


87474 Kidney Cancer 
(OD04622-01) 


15.4 


25 


Gastric ca.* 
(liver met) NCI- 
N87 


7.2 


87475 Kidney NAT 
(OD04622-03) 


2.3 


2.8 


Bladder 


2.5 


85973 Kidney Cancer 
(OD04450-01) 


3.6 


7.3 


Trachea 


4.9 


85974 Kidney NAT 
(OD04450-03) 


1.3 


13.3 


Kidney 


0 


Kidney Cancer Clontech 
8120607 


7.3 


52 


Kidney (fetal) 


7 


Kidney NAT Clontech 
8120608 


1.1 


3.6 


Renal ca. 
786-0 


0 


Kidney Cancer Clontech 
8120613 


2.8 


7.5 


Renal ca. 
A498 


1.5 


Kidney NAT Clontech 
8120614 


3.7 


8 


Renal ca. 
RXF393 


3 


Kidney Cancer Clontech 
9010320 


17.4 


14.1 


Renal ca. 
ACHN 


1.1 


Kidney NAT Clontech 
9010321 


16.4 


11.3 


Renal ca. 
UO-31 


0 


Normal Uterus GENPAK 
061018 


4.6 


5.2 


Renal ca. 
TK-10 


1.2 


Uterus Cancer GENPAK 
064011 


17.2 


32.4 


Liver 


0.6 


Normal Thyroid Clontech A+ 
6570-1 


17.3 


18.1 


Liver (fetal) 


8.8 


Thyroid Cancer GENPAK 
064010 


6.5 


1.1 


Liver ca. 

(hepatoblast) 

HepG2 


24.6 


Thyroid Cancer 
INVrrROGEN A302152 


1.7 


1.6 


Lung 


1.3 


Thyroid NAT INVTTROGEN 
A302153 


9.2 


17.7 


Lung (fetal) 


15.8 


Normal Breast GENPAK 
061019 


21.3 


30.1 


Lung ca. (small 
cell) LX-1 


1.5 


84877 Breast Cancer 
(OD04566) 


1.4 


1.5 


Lung ca. (small 
cell) NCI-H69 


4.2 


85975 Breast Cancer 
(OD04590-01) 


12.9 


10.1 


Lung ca. (s.cell 
var.) SHP-77 


1.4 


85976 Breast Cancer Mets 
(OD04590-03) 


100 


88 


Lung ca. (large 
cell)Na-H460 


0 


87070 Breast Cancer 
Metastasis (OD04655-05) 


32.8 


0 


Lung ca. (non- 
sm. cell) A549 


2.9 


GENPAK Breast Cancer 
064006 


25 


28.5 


Lung ca. (non- 
s.cell)Na-H23 


9.5 


Breast Cancer Res. Gen. 1024 


51.8 


66.3 
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Lung ca (non- 
s.cell) HOP-62 


3.4 


Breast Cancer Clontech 
9100266 


25 


25.1 


Lung ca. (non- 
s.cl)Na-H522 


0 


Breast NAT Clontech 9100265 


18.2 


31.7 


Lung ca. 
(squam.) SW 
900 


4.7 


Breast Cancer INVITROGEN 
A209073 


15 


21 


Lung ca. 
(squam.) NCI- 
H596 


0 


Breast NAT INVITROGEN 
A2090734 


16.2 


12.3 


Mammary 
gland 


0 


Normal Liver GENPAK 
061009 


5.6 


4.9 


Breast ca.* (pi. 
effusion) MCF- 
7 


0.7 


Liver Cancer GENPAK 
064003 


5.4 


11.8 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


6.1 


Liver Cancer Research 
Genetics RNA 1025 


3.7 


3.6 


Breast ca.* (pi. 

effusion) 

T47D 


5.1 


Liver Cancer Research 
Genetics RNA 1026 


4.8 


5.5 


Breast ca. 
BT-549 


2.8 


Paired Liver Cancer Tissue 
Research Genetics RNA 6004- 
T 


10.6 


7 


Breast ca. 
MDA-N 


0 


paired liver Tissue Research 
Genetics RNA 6004-N 


19.3 


28.4 


Ovary 


2.7 


Paired Liver Cancer Tissue 
Research Genetics RNA 6005- 
T 


4.6 


6 


Ovarian ca. 
OVCAR-3 


3.2 


Paired liver Tissue Research 
Genetics RNA 6005-N 


5 


0 


Ovarian ca. 
OVCAR-4 


3.7 


Normal Bladder GENPAK 
061001 


18.3 


11.5 


Ovarian ca. 
OVCAR-5 


2.5 


Bladder Cancer Research 
Genetics RNA 1023 


7.9 


21.4 


Ovarian ca. 
OVCAR-8 


1.7 


Bladder Cancer 
INVITROGEN A302173 


33 


18.9 


Ovarian f*a 

V-V V OJ. 1 CLL1 KftXt 

IGROV-1 


0 


87071 Til adder Cancer 
(OD04718-01) 


11 


18.5 


\jvaxiaxi \j<x. 

(ascites) SK- 
OV-3 


1 .o 


87077 Rl adder Normal 

Adjacent (OD047 18-03) 


9.4 


9.7 


Uterus 


24.6 • 


Normal Ovary Res. Gen. 


11.6 


11.3 


Placenta 


2.8 


Ovarian Cancer GENPAK 
064008 


23.3 


22.9 


Prostate 


2.5 


87492 Ovary Cancer 
(OD04768-07) 


80.1 


100 


Prostate ca.* 
(bone met)PC-3 


26.5 


87493 Ovary NAT 
(OD04768-08) 


3.8 


6.4 
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Testis 


0 


Normal Stomach GENPAK 
061017 


15.9 


17.1 


Melanoma 
Hs688(A).T 


0 


Gastric Cancer Clontech 
9060358 


36.3 


45.8 


Melanoma* 
(met) 

Hs688(B).T 


0 


NAT Stomach Clontech 
9060359 


44.4 


60.8 


Melanoma 
UACC-62 


1.2 


Gastric Cancer Clontech 
9060395 


13.8 


20.4 


Melanoma 
M14 


0 


NAT Stomach Clontech 
9060394 


78.5 


93 


Melanoma 
LOXIMVI 


0.6 


Gastric Cancer Clontech 
9060397 


5.8 


11.5 


Melanoma* 
(met) SK- 
MEL-5 


4.1 


NAT Stomach Clontech 
9060396 


25.5 


30.1 


Adipose 


2 


Gastric Cancer GENPAK 
064005 


35.1 


- 27.6 



Table 14. Panel 3D 



Tissue Name 


Rel. Expr., % 
3dtm5227f agl605 


94905_Daoy_Medulloblastoma/Cerebellum_sscDNA 


0.0 | 


94906_TE671JMedulloblastom/Cerebellum_sscDNA 


0.4 


94907jD283Med_Medulloblastoma/Cerebellum_sscDNA 


0.0 


94908_PFSK-l_Primitive 
Neuroectodermal/Cerebellum sscDNA 


0.0 


94909_XF-498_CNS_sscDNA 


0.0 


94910j3NB-78_CNS/glioma_sscDNA 


0.0 


9491 l_SF-268_CNS/glioblastoma_sscDNA 


0.0 


94912 T98G Glioblastoma sscDNA 


0.0 


96776_SK-N-SH_Neuroblastoma(metastasis)_sscDNA 


0.0 


94913_SF-295_CNS/glioblastoma_sscDNA 


0.2 


94914 Cerebellum sscDNA 


1.9 


96777_Cerebellum_sscDNA 


3.4 


94916_NCI-H292_Mucoepideimoid lung carcinoma_sscDNA 


0.4 


94917_DMS-1 14_Small cell lung cancer_sscDNA 


0.0 


94918_DMS-79_Small cell lung 
cancer/neuroendocrine sscDNA 


. 3.8 


94919_NCI-H146_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 


94920_NCI-H526_Small cell lung 
cancer/neuroendocrine sscDNA 


14.6 


94921_NCI-N417_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 
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94923 JsO-H82_Small cell lung 
canceE/neuroendocrine_sscDNA 


0.2 


94924 JSO-H157_Squamous cell lung cancer 
(metastasis) sscDNA 


0.0 


94925_NC1-H1 1 55_Large cell lung 
cancer/neuroendocrine sscDNA 


0.0 


94926_NCI-H1299_Large cell lung 
cane cl/h ciiroenQocnnc sscl/in/\ 


0.0 


94997 NPT-TT797 T nnu rarcrnoin ssrHNA 


\JmJ 


9499 R NPMTA/fP-l 1 T una carcinoid qqcDNA 

i > V_/i KJ 1Y1V^ 1 1 l_jUUg WCllVllllSlU OoH/IN/a 


\Jm\J 


94999 T Y-1 ^mall cpII luntr cancpr qqpDNA 
" T/i -/ i on ion tcu lung i/aiivvd ssovJ^/iix\. 




949^0 Po1o-9fK Polon cancer <?scDNA 


0 0 


949^ 1 TCTM1 9 Polon cancer cfirTYW A 


n o 


949^9 JCM70T 9 Polon cancer cccDWA 


n o 

\Jw\J 


94913 NPT-H716 Polon cancer <sseD*NA 




9493^ RW-4R Colon adenocarcinoma ^seTYWA 


0 1 


94936 SW1 116 Polon adenocarcinoma ^vDNA 

-/~S-}\J VJ YV 1 1 IV V^VJ 1V711 aVJ-wlUVCll 1/lllUllla J Jlvl^/l l.f\ 


00 




u.u 


9493 R ^W-94R Polon nHenncarpfnoma QQcrVMA 

7t7JO O ¥V "7tO v^vSlxJil aU.vlltJL/dlL'lllUiXla aaVv.L71i.r\. 




94939 Polon adenornrrinomn qqpTI^JA 

7T/J7 O YV ^^0\/ vAJlUll aUCliUwctJLvllll/lllci OaVy.L/rN/A. 


0 0 


94940 MPr-^MT T-S fra^rric carcinoma ^cTYNTA 

✓ ~^"v liv/1 Jil w *J VJooU.lt L-alL/LUAJllla oo^LJlyix 


0 9 


94941 "RTATnTTT fra<srric carcinoma ccpTVMA 


fl 4 


94943 XrPT-^TsJT T-.1 f\ OaQtrie carrfnoma ocrTYNJA 
7*T7*tj i>Vvi oil L7— ivj vja.ou.1^ caiuiiiuiiid aoUL/i>i/\ 


fl fl 


94944 NPI-SNTI-1 Ga<dric carcinoma qRcTCNA 

> ~7 i I livl Oil \J 1 VJfloUlv UiXlwlllUlllM- 


0 0 


94Q46 T?T?»1 P-nc+rip q Hf»Ti r\r»s* T*r*iTir\TYi a ccr»TYWA 
y*ty*i u .rvr i vjtiD in t» auciiuuai t-ixiuma sot/i^i>j/\. 


1 S 9 ! 


94947 "RT7-4R frac+rif* ar1f»nr%r*5iT"rM'nr»m?» ccr^rVKTA 
7*t7t /_JVr***tO_vja.E>U.lv aUvllUvalwlliUUlii_D5Vviyii(/\ 


1 8 7 
1 o. / 


96778_MKN-45_Gastric carcinoma_sscDNA 


0.0 


94949_NCI-N87__Gastric carcinoma_sscDNA 


0.0 


94951_0VCAR-5_0vanan carcinoma_sscDNA 


0.0 


94952 J*X95-2JJterine carcinoma__sscDNA 


0.0 


94953_HelaS3_Cervical adenocarcinoma_sscDNA 


0.0 


94954_Ca Ski_Cervical epidermoid carcinoma 
(metastasis)_sscDNA 


0.2 


94955_ES-2_Ovarian clear cell carcinoma_sscDNA 


0.0 


94957_Ramos/6h stim_"; Stimulated with PMA/ionomycin 
6h sscDNA 


14.7 


94958_Ramos/14h stim_ M ; Stimulated with PMA/ionomycin 
14h sscDNA 


22.7 


94962_MEG-01_Chronic myelogenous leukemia 

fin pfrnVnrvnhl a Qt - ^ co^TYKTA 

^lUCgUAill jrVJUldolJ ooWJL/lNrt. 


0.5 


94963_Raji_Burkitt ! s lymphoma_sscDNA 


32.3 


94964_Daudi_Burkitt t s lynuphoma_sscDNA 


100.0 


94965_U266_B-cellplasmacytoma/myeloma_sscDNA 


0.3 


94968__CA46_Burldtt , s lymphoma_sscDNA 


22.5 


94970_RL_non-Hodgkin , s B-cell lymphoma__sscDNA 


29.5 
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94972 JM1 pre-B-cell lymphoma/1 eukeroia sscDNA 


23.5 


94973 Jurkat T cell leukemia sscDNA 


2.2 


94974 TF-1 Erythroleukemia sscDNA 


0.3 


94975 HUT 78 T-cell lymphoma sscDNA 


1.6 


94977 U937 Histiocytic lymphoma sscDNA 


7.3 


94980 KU-812 Mveloeenous leukemia sscDNA 


0.1 


94981 769-P Clear cell renal carcinoma sscDNA 


0.2 


94983_Caki-2_CIeax cell renal carcinoma_sscDNA 


U.i 


94984JSW 839_CIear cell renal carcinomajjscDNA 


1.0 


94986_G401_Wilms , tumor_sscDNA 


0.0 


94987_Hs766T_Pancreatic carcinoma (LN metastasis)_sscDNA 


0.1 


94988_CAPAN-l_Pancreatic adenocarcinoma (liver 
metastasis)_sscDNA 


0.0 


94989_SU86.86_Pancreatic carcinoma (liver 
metastasis)_sscDNA 


0.0 


94990_BxPC-3Pancreatic adenocarcmoma_sscDNA 


a 1 

0.1 


94991 HPAC Pancreatic adenocarcinoma sscDNA 


A 1 

0.1 


94992 MIA PaCa-2 Pancreatic carcinoma sscDNA 


A A 

0.0 


94993_CFPAC-l_Pancreatic ductal adenocarcinoma_sscDNA 


0.2 


94994_PAN01_Pancreatic epithelioid ductal 

CaJVluuIIia aoLriyi\/\ 


a a 
0.0 


94996 T24 BlarldeT earcinma f transitional celH sscDNA 


0.0 


94997 5637 Bladder carcinoma sscDNA 


0.0 


94998 HT-1197 Bladder carcinoma sscDNA 


0.0 


94999_UM-UC-3_Bladder carcinma (transitional cell)_sscDNA 


0.0 


95000_A2Q4_Rnabdomyosarcoma_sscDNA 


A A 

0.0 


9500 1_HT- 1 0 80_Fibrosarcoma_sscDNA 


A A 

0.0 


95 002_MG-63_Osteosarcoma (bone)_sscDNA 


A 1 

0.1 


95003 JSKrLMS- 1 ^Leiomyosarcoma (vulva)_ sscDNA 


0.2 


95004 SJRH30_Rhabdomyosarcoma (met to bone 


A A 

0.0 


OSOfl^ A4*^1 TfniHfMmfiiH carcinoma sscDNA 


0.0 


Q5007 "WM266-4 Melanoma sscDNA 


0.0 


95010 DU 145 Prostate carcinoma (brain metastasis) sscDNA 


0.0 


9501 2_MD A-MB-46 8_Breast adenocarcinoma^sscDNA 


0.0 


95013_SCC-4_Squamous cell carcinoma of tongue_sscDNA 


0.0 ■ 


95014_SCC-9_Squamous cell carcinoma of tongue_sscDNA 


0.3 


95015_SCC-15__Squamous cell carcinoma of tonguesscDNA 


0.0 


95017_CAL 27_Squamous cell carcinoma of tongue_sscDNA 


0.0 



The RTQ-PCR analysis (Table 13 and 14) reveals that MOL1 is predominantly expressed 
in cell lines derived from lymphoma and leukemia, specifically Burkitfs lymphoma in panel 3D. 
This result is supported by the presence of GenBank ESTs coining from T cells from T cell 
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leukemia (see Unigene http://www.ncbi.nlm.nih.gov/U^ 

Recent report indicates that this receptor nomally aids the immune cells to sense the presence of 
unmethylated CpG dinucleotides (Hemmi H, Takeuchi O, Kawai T, Kaisho T, Sato S, Sanjo H, 
Matsumoto M, Hoshino K, Wagner H, Takeda K, Akira S. A Toll-like receptor recognizes 
5 bacteria] DNA. Nature 2000 Dec 7;408(6813):740-5) and to induce proliferation of splenocytes, 
inflammatory cytokine production from macrophages and maturation of dendritic cells. The 
signaling pathway mediated by toll-like receptor 9 is through the activation of NF-kB. There is 
evidence that NF-kappaB activity is necessary for survival of lymphoma and leukemia cells 
(Constitutive activation of NF-kappaB in primary adult T-cell leukemia cells. Mori N, Fujii M, 

10 Ikeda S, Yamada Y, Tomonaga M, Ballard DW, Yamamoto N. Blood 1 999 Apr 1 ;93(7):2360-8; 
Inhibition of NF-kappaB induces apoptosis of KSHV-infected primary effusion lymphoma cells. 
Keller SA, Schattner EJ, Cesarman E. Blood, 1 October 2000, 96, No. 7, pp. 2537-2542). 
Overexpression of toll-like receptor 9 by lymphoma and leukemia cells is likely to mediate 
ligand-independent signaling, affecting the normal processes of activation, proliferation and. 

1 5 tumorogenesis. Therefore the protein encoded (GMJ79960 1 78) may serve as a potential marker 
for lymphoma and leukemia cells. In addition, human monoclonal antibodies directed against 
this protein could be therapeutics for the treatment of lymphoma and leukemia. 

MOL2 

20 Expression of gene MOL2 was assessed using the primer-probe set Ag743, described in 

Table 15. Results of the RTQ-PCR runs are shown in Table 16. 



Table 15. Probe Name: Ag743 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5'-ATGTCTTGGTGGATGCAGAA-3 ' 


59.1 


20 


1304 


79 


Probe 


TET-5 1 - 

CGGACTATAGCATTTCTAAGCGCCTCG- 
3-TAMRA 


69.2 


27 


1340 


80 


Reverse 


5'-CACATCCTCCTTGCAAATGT-3' 


58.6 


20 


1370 


81 



25 

Table 16. Panels 13D and 4D 



PANEL 1.3 D 




PANEL 4D 
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Tissue Name 


Rel. Expr., % 
1.3dx4tm5604t ag743 
al 


Tissue Name 


Rel. Expr., % 
4Dtm2477t ag7 
43 


Liver 

adenocarcino 
ma 


36.4 


93768_Secondary Thl_anti- 
CD28/anti-CD3 


27.2 


Pancreas 


4.6 


93769_Secondary Th2_anti- 
CD28/anti-CD3 


19.6 


Pancreatic ca. 
CAP AN 2 


16.6 


93770 JSecondaryTrl_anti- 
CD28/anti-CD3 


23.2 


Adrenal gland 


6.5 


93573_Secondary Thl_resting 
day 4-6 in EL-2 


7.2 


Thyroid 


5.2 


93572_Secondary Th2_resting 
day 4-6 in EL-2 


7.1 


Salivary gland 


12.0 


93571_Secondary Trl_resting 
day 4-6 in IL-2 


11.9 


Pituitary gland 


22.7 


93568_primary Thl_anti- 
CD28/anti-CD3 


43.2 


Brain (fetal) 


8.4 


93569_primary Th2_anti- 
CD28/anti-CD3 


32.1 


Brain (whole) 


38.6 


93570_primary Trl_anti- 
CD28/anti-CD3 


50.0 


Brain 

(amygdala) 


24.8 


93565jprimary Thl_resting dy 
4-6 in EL-2 


43.8 


Brain 

(cerebellum) 


31.4 


93566_primary Th2_resting dy 
4-6 in IL-2 


16.4 


Brain 

(hippocampus) 


22.0 


93567_primary Trl_resting dy 
4-6 in EL-2 


22.4 


Brain 

(substantia 

nigra) 


11.8 


93351_CD45RACD4 
lymphocyte anti-CD28/anti- 
CD3 


23.0 


Brain 
(thalamus) 


30.3 


93352_CD45RO CD4 
lymphocyte anti-CD28/anti- 
CD3 


25.5 


Cerebral 
Cortex 


22.0 


93251_CD8 Lymphocytes_anti- 
CD28/anti-CD3 


16.0 


Spinal cord 


20.6 


93353_chronic CD8 
Lymphocytes 2ry resting dy 4-6 
inIL-2 


15.0 


CNS ca. 

(glio/astro) 

U87-MG 


28.1 


93574_chronic CD8 
Lymphocytes 2ry activated 
CD3/CD28 


10.2 


CNS ca. 
( elio/astro^ 
U-118-MG 


27.9 


93354_CD4_none 


4.0 


CNSca. 

(astro) 

SW1783 


31.1 


93252 Secondary 
Thl/Th2/Trl_anti-CD95 CH11 


9.6 


CNSca* 


19.3 


93 1 03_LAK cell3_resting 


12.9 
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(neuro; met ) 
SK-N-AS 








CNS ca. 

(astro) 

SF-539 


47.2 


93788_LAKcells_IL-2 


14.9 


CNSca. 

(astro) 

SNB-75 


10.2 


93787_LAK cellsJL-2+IL-12 


12.5 


CNS ca. (glio) 
SNB-19 


24.3 


93789JLAK cells_IL-2+IFN 
oamma 


22.2 


CNS ca. (glio) 
U251 


20.9 


93790_LAK cells_IL-2+ IL-1 8 


15.7 | 


CNS ca. (glio) 
SF-295 


10.7 


93104_LAK 

cells PMA/ionomycin and IL-1 8 


4.0 


Heart (fetal) 


2.0 


93578_NK Cells IL-2_resting 


12.0 


Heart 


6.5 


93 109_Mixed Lymphocyte 
Reaction Two Way MLR 


9.8 


Fetal Skeletal 


0.7 


93 1 10 JMixed Lymphocyte 
Reaction Two Way MLR 


14.3 


Skeletal 
muscle 


18.3 


93 1 1 l_Mixed Lymphocyte 
Reaction Two Way MLR 


11.7 


Bone marrow 


11.2 


931 12_Mononuclear Cells 
(PBMCs) resting 


5.8 


Thymus 


7.8 


93 1 13_Mononuclear Cells 
CPBMCs^ PWM 


51.8 


Spleen 


7.2 


931 14_Mononuclear Cells 
(PBMCs)_PHA-L 


27.5 


Lymph node 


7.5 


93249_Ramos (B cell)_none 


0.2 


Colorectal 


28.4 


93250_Ramos (B 
cell) ionomycin 


6.7 


Stomach 


6 5 




100 0 


Small intestine 


10.0 


93350JB lymphoytes_CD40L 
andIL-4 


23.7 


Colon ca. 
SW480 


23.2 


92665_EOL-l 

(Eosinophil)_dbcAMP 

difPerentiated 


15.5 


Colon ca.* 

(SW480 

met)SW620 


12.3 


93248_EOL-l 

(Eosinophil)_dbcAMP/PMAion 
omycin 


10.4 


Colon ca. 
HT29 


1.7 


93356_Dendritic Cellsnone 


15.3 


Colon ca. 
HCT-116 


13.1 


93355_Dendritic Cells_LPS 100 
ng/ml 


6.7 


Colon ca. 
CaCo-2 


12.5 


93775 Dendritic Cells anti- 
CD40~ 


16.7 


83219 CC 
Well to Mod 


10.4 


93774_Monocytes_resting 


14.9 
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Diff 

(OD03866) 








Colon ca. 
HCC-2998 


11.3 


93776 JMonocytesJLPS 50 
ng/ml 


4.0 


Gastric ca.* 
(liver met) 
NCI-N87 


17.6 


9358 1 Macrophagesjresting 


19.5 


Bladder 


17.6 


93582_Macrophages_LPS 100 
ng/ml 


7.5 


Trachea 


7.6 


93098_HUVEC 
(Endotheliai)_ none 


33.0 


Kidney 


5.7 


93099 JIUVEC 
(Endothelial)_starved 


66.0 


Kidney (fetal) 


6.4 


93 100 JIUVEC 
(Endotlielial)_IL-lb 


29.3 


Renal ca. 
786-0 


10.0 


93779_HUVEC 
(Endothelial) IFN gamma 


33.0 


Renal ca. 
A498 


23.6 


93102_HUVEC 

(Endothelial) JTNF alpha + IFN 
gamma 


23.0 


Renal ca. 
RXF393 


37.8 


93101JIUVEC 

(Endomelial) JTNF alpha + EL4 


19.9 


Renal ca. 
ACHN 


18.1 


93781_HUVEC 
(Endothelial)_IL-ll 


22.4 


Renal ca. 
UO-31 


26.1 


93583_Lung Microvascular 
Endothelial Cells none 


27.5 


Renal ca. 
TK-10 


10.9 


93 584_Lung Microvascular i 
Endothelial CellsJTNFa (4 
ng/ml) and ILlb (1 ng/ml) 


25.2 


Liver 


2.2 


92662_Microvascular Dermal 
endothelium none 


51.4 


Liver (fetal) 


4.4 


92663_Microsvasular Dermal 
endothelium^TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


22.5 


Liver ca. 

(hepatoblast) 

HepG2 


28.1 


93773_Bronchial 

epithelium TNFa (4 ng/ml) and 

ILlb (1 ng/ml) ** 


17.0 


Lung 


12.4 


93347__Small Airway 
Epitheliumnone 


20.9 


Lung (fetal) 


5.2 


93348_Small Airway 
Epithelium TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


84.7 


Lungca. 

(Small Cell ) 

LX-1 


10.8 


92668_Coronery Artery 


37.9 


Lung ca. 
(small cell) 
NCI-H69 


20.2 


92669 Coronery Artery 

SMC TNFa (4 ng/ml) and ILlb 

(1 ng/ml) 


26.8 


Lung ca. 
/~ — it — \ 


20.1 


93 107_astrocytes_resting 


47.6 
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(s.cell var.) 
SHP-77 








Lung ca, 
(large 
cell)NCI- 
H460 


10.7 


93108_astrocytes_TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


20.0 


Lung ca. (non- 
sm. cell) A549 


5.5 


92666JCU-812 
(Basophil) resting 


25.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


132 


92667_KU-812 
(Basophil)_PMA/ionoycin 


31.9 


Lung ca (non- 
s.cell) HOP- 
62 


18.3 


93579_CCD1106 
(Keratinocytes)_none 


47.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


13.8 


93580_CCD1106 
(Keratinocytes)_TNFa and IFNg 
** 


7.2 


Lungca. 
(squam.) 
SW900 


17.2 


93791_Liver Cirrhosis 


4.2 


Lung ca. 
(squam.) 
NCI-H596 


20.9 


93792_Lupus Kidney 


3.8 


Mammary 
gland 


17.8 


93577_NCI-H292 


55.1 


Breast ca.* 
(pi. effusion) 
MCF-7 


20.4 


93358_NCI-H292JL-4 


68.8 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


22.5 


93360_NQ-H292_IL-9 


59.5 


Breast ca.* 
(pi. effusion) 
T47D 


15.5 


93359_NCI-H292_IL-13 


35.1 


Breast ca. 
BT-549 


13.9 


93357_NCI-H292_IFN gamma 


37.9 


Breast ca. 
MDA-N 


9.9 


93777_HPAEC_- 


32.3 


Ovary 


5.6 


93778_HPAEC_IL-1 beta/TNA 
alpha 


23.2 


Ovarian ca. 
OVCAR-3 


15.4 


93254_Normal Human Lung 
Fibroblast none 


29.7 


Ovarian ca. 
OVCAR-4 


12.4 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and 
EL-lb (1 ng/ml) 


16.8 


Ovarian ca. 
OVCAR-5 


27.2 


93257_Nonnal Human Lung 
Fibroblast EL-4 


55.5 


Ovarian ca. 
OVCAR-8 


9.0 


93256_Normal Human Lung 
Fibroblast IL-9 


34.9 



160 



WO 01/81578 



PCT/US01/13578 



Ovarian ca. 
IGROV-1 


22.6 


93255_Norrnal Human Lung 
Fibroblast IL-13 


36.9 


Ovarian ca * 
(ascites) SK- 
OV-3 


100.0 


93258_Normal Human Lung 
FibroblastJFN gamma 


58.6 


Uterus 


6.1 


93106_Dermal Fibroblasts 
CCD1070_resting 


74.2 


Plancenta 


15.8 


93361_Dermal Fibroblasts 
CCD1070 TNF alpha 4 ng/ml 


64.2 


Prostate 


5.6 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml . 


27.7 


Prostate ca,* 
(bone met)PC- 
3 


50.1 


93772_dermal fibroblast IFN 
gamma 


24.0 


Testis 


8.3 


93771 dermal fibroblast EL-4 


58.2 


Melanoma 
Hs688(A).T 


12.6 


93259_BD Colitis 1** 


2.8 


Melanoma* 
(met) 

Hs688(B).T 


14.5 


93260_IBD Colitis 2 


1.1 


Melanoma 
UACC-62 


18.8 


93261_IBD Crohns 


2.1 


Melanoma 
M14 


19.7 


735010_Colon_normal 


14.3 


Melanoma 
LOXIMVI 


8.8 


73501 9_Lung_none 


21.5 


Melanoma* 
(met) SK- 
MEL-5 


11.0 


64028- l_Thymus_none 


27.0 


Adipose 


14.5 


64030- l_Kidney_none 


31.2 



MOL2 is widely expressed in tissues and cell lines represented in both panels 1 .3D and 
4D, with highest expression being in one ovarian cancer cell line (SK-OV-3). Thus, it could 
serve as a diagnostic marker for ovarian cancer. 



MOL3 

Expression of MOL3 was assessed using the primer-probe sets Ag474 and Ag770, 
described in Tables 17 and 18. Results of the RTQ-PCR runs are shown in Tables 19 and 20. 

10 Table 17, Probe name: Ag474 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 

NO: 


Forward 


5'-GGCACTGTCCCTCTGCACAT-3' 


20 


493 


82 


Probe 


FAM-5'- 

CCCTGAGAAAGATCTGCCACAAAGACATCT 
G-3-TAMRA 


31 


516 


83 


Reverse 


5'-AACCTGCCCACAGAGCAATC-3' 


20 


549 


84 



Table 18. Probe name: Ag770 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 

Positio 

n 


Forward 


5'-ACAGTGCTGTCGCTGGTACTT-3' 


60 


21 


717 


Probe 


FAM-5'- 

TTCGTACTCAAAGGCGTACCTCTCCA-3- 
TAMRA 


67.9 


26 


746 


Reverse 


5'-CTCAAACAGCTCACGAGTGAT-3' 


58.1 


21 


773 



Table 19. Ag474 



PANEL 1.3D 




Panel 2D 




Tissue Name 


Rel. Expr., % 
1.3Dtm3254f_ag474 


Tissue Name 


Rel. Expr., % 
2Dtm3255f ag47 
4 


Liver 

adenocarcinoma 


0.0 


Normal Colon GENPAK 
061003 


0.3 


Pancreas 


0.0 


83219 CC Well to Mod 
Diff(OD03866) 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


83220 CC NAT 
(OD03866) 


0.0 


Adrenal gland 


0.3 


83221 CC Gr.2 
rectosigmoid (OD03868) 


0.0 


Thyroid 


0.0 


83222 CCNAT 
(OD03868) 


0.0 


Salivary gland 


0.0 


83235 CC Mod Diff 
(ODO3920) 


0.0 


Pituitary gland 


0.0 


83236 CC NAT 
(ODO3920) 


0.0 


Brain (fetal) 


0.2 


83237 CC Gr.2 ascend 
colon (OD03921) 


0.0 


Brain (whole) 


0.0 


83238 CCNAT 
(OD03921) 


0.0 


Brain (amygdala) 


0.1 


83241 CC from Partial 
Hepatectomy (ODO4309) 


3.8 


Brain (cerebellum) 


0.0 


83242 Liver NAT 
(ODO4309) 


100.0 
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Brain 

(hippocampus) 


0.0 


87472 Colon mets to lung 
(OD04451-01) 


0.0 


Brain (substantia 
nigra) 


0.0 


87473 Lung NAT 
(OD04451-02) 


0.0 


Brain (thalamus) 


0.0 


Normal Prostate Clontech 
A+ 6546-1 


0.0 


Cerebral Cortex 


0.1 


84140 Prostate Cancer 
(OD04410) 


0.0 


Spinal cord 


0.0 


84141 Prostate NAT 
(OD04410) 


0.0 


CNS ca. (glio/astro) 
U87-MG 


0.3 


87073 Prostate Cancer 
(OD04720-01) 


0.0 


CNSca. 

(glio/astro) U-118- 
MG 


0.0 


87074 Prostate NAT 
(OD04720-02) 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


Normal Lung GENPAK 
061010 


0.0 


CNS ca.* (neuro; 
met)SK-N-AS 


0.0 


83239 Lung Met to Muscle 
(OD04286) 


0.0 


CNS ca. (astro) 
SF-539 


0.0 


83240 Muscle NAT 
(OD04286) 


0.0 


CNS ca. (astro) 
SNB-75 


0.0 


84136 Lung Malignant 
Cancer (OD03 126) 


0.0 


CNS ca. (glio) 
SNB-19 


0.0 


84137 Lung NAT 
(OD03126) 


0.0 


CNS ca. (glio) 
U251 


0.0 


84871 Lung Cancer 
(OD04404) 


0.0 


CNS ca. (glio) 
SF-295 


0.0 


84872 Lung NAT 
(OD04404) 


0.0 


Heart (fetal) 


0.0 


84875 Lung Cancer 
(OD04565) 


0.0 


Heart 


0.0 


84876 Lung NAT 
(OD04565) 


0.0 


Fetal Skeletal 


0.2 


85950 Lung Cancer 
(OD04237-01) 


0.0 


Skeletal muscle 


0.0 


85970 Lung NAT 
(OD04237-02) 


0.0 


Bone marrow 


0.0 


83255 Ocular Mel Met to 
Liver (ODO4310) 


0.0 


Thymus 


0.0 


83256 Liver NAT 
(ODO4310) 


64.2 


Spleen 


0.2 


84139 Melanoma Mets to 
Lung (OD04321) 


0.0 


Lymph node 


0.0 


84138 Lung NAT 
(OD04321) 


0.0 


Colorectal 


0.0 


Normal Kidney GENPAK 
061008 


0.0 


Stomach 


0.1 


83786 Kidney Ca, 
Nuclear grade 2 


0.0 
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(OD04338) 




Small intestine 


0.0 


83787 Kidney NAT 
(OD04338) 


0.0 


Colon ca. 
SW480 


0.3 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


0.0 


Colon ca.*(SW480 
met)SW620 


0.0 


83789 Kidney NAT 
(OD04339) 


0.0 


Colon ca. 
HT29 


0.0 


83790 Kidney Ca, Clear 
cell type (OD04340) 


0.1 


Colon ca. 
HCT-116 


0.0 


83791 Kidney NAT 
(OD04340) 


0.0 


Colon ca. 
CaCo-2 


0.3 


83792 Kidney Ca, Nuclear 
grade3(OD04348) 


0.0 


83219 CC Well to 

ModDiff 

(OD03866) 


0.3 


83793 Kidney NAT 
(OD04348) 


0.0 


Colon ca. 
HCC-2998 


0.5 


87474 Kidney Cancer 
(OD04622-01) 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


87475 Kidney NAT 
(OD04622-03) 


0.0 


Bladder 


0.0 


85973 Kidney Cancer 
(OD04450-01) 


0.0 


Trachea 


0.0 


85974 Kidney NAT 
(OD04450-03) 


0.1 


Kidney 


0.0 


Kidney Cancer Clontech 
8120607 


0.0 


Kidney (fetal) 


0.0 


Kidney NAT Clontech 
8120608 


0.0 


Renal ca. 
786-0 


0.0 


Kidney Cancer Clontech 
8120613 


0.0 


Renal ca. 
A498 


0.0 


Kidney NAT Clontech 
8120614 


0.0 


Renal ca. 
RXF393 


0.0 


Kidney Cancer Clontech 
9010320 


0.0 


Renal ca. 
ACHN 


0.0 


Kidney NAT Clontech 
9010321 


0.0 . 


Renal ca. 
UO-31 


0.3 


Normal Uterus GENPAK 
061018 


0.0 


Renal ca. . 
TK-10 


0.1 


Uterus Cancer GENPAK 
064011 


0.0 


Liver 


100.0 


Normal Thyroid Clontech 
A+ 6570-1 


0.0 


Liver (fetal) 


1.8 


Thyroid Cancer GENPAK 
064010 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


Thyroid Cancer 
INVTTROGEN A302152 


0.0 


Lung 


0.0 


Thyroid NAT 
INVTTROGEN A302153 


0.0 
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Lung (fetal) 


0.0 


Normal Breast GENPAK 
061019 


0.0 


Lung ca/(small 
cell) LX-1 


0.0 


84877 Breast Cancer 
(OD04566) 


0.0 


Lung ca. (small 
cell) NCI-H69 


0.0 


85975 Breast Cancer 
(OD04590-01) 


0.0 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


85976 Breast Cancer Mets 
(OD04590-03) 


0.0 


Lung ca. (large 
celI)NCI-H460 


0.0 


87070 Breast Cancer 
Metastasis (OD04655-05) 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.1 


GENPAK Breast Cancer 
064006 


0.2 


Lung ca. (non- 
s.cell)NCI-H23 


0.2 


Breast Cancer Res. Gen. 
1024 


0.2 


Lung ca (non-s.cell) 
HOP-62 


0.0 


Breast Cancer Clontech 
9100266 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


Breast NAT Clontech 
9100265 


0.0 


Lung ca, (squam.) 
SW900 


0.0 


Breast Cancer 
INVITROGENA209073 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


Breast NAT 

3NVITROGEN A2090734 


0.0 


Mammary gland 


0.0 


Normal Liver GENPAK 
061009 


38.4 


Breast ca.* (pi. 
effusion) MCF-7 


0.0 


Liver Cancer GENPAK 
064003 


49.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Liver Cancer Research 
Genetics RNA 1025 


62.4 


Breast ca.* (pi. 
effusion) T47D 


0.0 


Liver Cancer Research 
Genetics RNA 1026 ! 


12.9 


Breast ca. 
BT-549 


0.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


74.2 


Breast ca. 
MDA-N 


0.1 


Paired Liver Tissue 
Research Genetics RNA 
6004-N 


8.5 


Ovary 


0.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6005-T 


5.0 


Ovarian ca. 
OVCAR-3 


0.5 


Paired Liver Tissue 
Research Genetics RNA 
6005-N 


18.6 

- 


Ovarian ca. 
OVCAR-4 


0.0 


Normal Bladder GENPAK 
061001 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Bladder Cancer Research 
Genetics RNA 1023 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bladder Cancer 
INVITROGENA302173 


0.0 


Ovarian ca. 


0.0 


87071 Bladder Cancer 


0.0 
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IGROV-l 




(OD04718-01) 




Ovarian ca.* 
(ascites) SK-OV-3 


0.0 


87072 Bladder Normal 
Adjacent (OD04718-03) 


0.0 


Uterus 


0.0 


Normal Ovary Res. Gen. 


0.0 


Placenta 


0.0 


Ovarian Cancer GENPAK 
064008 


0.0 


Prostate 


0.1 


87492 Ovary Cancer 
(OD04768-07) 


0.1 


Prostate ca.* (bone 
met)PC-3 


0.0 


87493 Ovary NAT 
(OD04768-08) 


0.0 


Testis 


0.3 


Normal Stomach GENPAK 
061017 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Gastric Cancer Clontech 
9060358 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


NAT Stomach Clontech 
9060359 


0.0 


Melanoma 
UACC-62 


0.0 


Gastric Cancer Clontech 
9060395 


0.4 


Melanoma 
M14 


0.0 


NAT Stomach Clontech 
9060394 


0.0 


Melanoma 
LOXIMVI 


0.0 


Gastric Cancer Clontech 
9060397 


0.3 


Melanoma* (met) 
SK-MEL-5 


0.0 


NAT Stomach Clontech 
9060396 


0.0 


Adipose 


0.0 


Gastric Cancer GENPAK 
064005 


0.0 



Table 20. Ag770 



PANEL 1.3D 




PANEL 4D 






Tissue Name 


Rel. Expr., % 
1.3dx4tm5495 
f_ag770_b2 


Tissue Name 


Rel. Expr., % 
4dtml843f ag7 
70 


Rel. Expr., 
% 

4Dtml910f 
ag770 


Liver 

adenocarcinoma 


0.0 


93768_Secondary 

Th l_anti-CD2 8/anti-CD3 


0.0 


0.0 


Pancreas 


0.0 


93769_Secondary 
Th2_anti-CD28/anti-CD3 


6.4 


0.0 


Pancreatic ca. 
CAP AN 2 


0.0 


93770_Secondary 
Trl_anti-CD28/anti-CD3 


3.1 


0.0 


Adrenal gland 


0.0 


93573_Secondary 

Thl resting day 4-6 in EL- 

2 


0.0 


0.0 


Thyroid 


0.0 


93572_Secondary 
Th2_resting day 4-6 in IL- 
2 


0.0 


0.0 


Salivary gland 


0.0 


93571_Secondary 
Trl_resting day 4-6 in IL-2 


0.0 


0.0 
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Pituitary gland 


0.0 


93568 jrimary Thl_anti- 
CD28/anti-€D3 


3.7 


0.0 


Brain (fetal) 


0.0 


93569_primary Th2 anti- 
CD28/anti-CD3 


0.0 


0.0 


Brain (whole) 


0.0 


93570_primary Trl_anti- 
CD28/anti-CD3 


0.0 


0.0 


Brain 

(amygdala) 


0.0 


93565j?rimary 
Thl_resting dy 4-6 in IL-2 


0.0 


4.8 


Brain 

(cerebellum) 


0.2 


93566_primary 
Tli2_resting dy 4-6 in IL-2 


0.0 


2.2 


Brain 

(hippocampus) 


0.1 


93567 j>rimary Trljresting 
dy 4-6 in IL-2 


0.0 


7.0 


Brain 

(substantia 

nigra) 


0.0 


93351_CD45RA CD4 

lymphocyte_anti- 

CD28/anti-CD3 


3.1 


0.0 


Brain 
(thalamus) 


0.0 


93352_CD45RO CD4 

lymphocyte_anti- 

CD28/anti-CD3 


0.0 


0.0 


Cerebral Cortex 


0.0 


93251J3D8 
Lymphocytes anti- 
CD28/anti-CD3 


1.4 


0.0 


Spinal cord 


0.0 


93353_chronic CD8 
Lymphocytes 2ry_resting 
day 4-6 in IL-2 


0.0 


0.0 


CNS ca. 

(glio/astro) 

U87-MG 


0.0 


93574_chronic CD8 

Lymphocytes 

2ry activated CD3/CD28 


0.0 


0.0 


CNS ca. 
(glio/astro) U- 
118-MG 


0.0 


93354_CD4_none 


0.0 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


93252_Secondary 
Thl/Th2/Trl anti-CD95 
CH11 


0.0 


41.2 


CNSca.* 
(neuro; met ) 
SK-N-AS 


0.0 


93103JLAK cells j-esting 


0.0 


0.0 


CNS ca. (astro) 
SF-539 


0.0 


93788 JLAK cells JL-2 


0.0 


0.0 


CNS ca. (astro) 
SNB-75 


0.0 


93787 LAK cells IL- 
2+IL-12 


0.0 


0.0 


CNS ca. (glio) 
SNB-19 


0.0 


93789_LAKcells_IL- 
2+IFN gamma 


0.0 


0.0 


CNS ca. (glio) 
U251 


0.0 


93790 LAK cells IL-2+ 
IL-18 


0.0 


0.0 


CNS ca. (glio) 
SF-295 


0.0 


93104_LAK 

cells PMA/ionomycin and 
IL-18 


0.0 


5.9 
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xxCox i yi&ialj 


0.0 


93578 NIC Pells TT^ 
^resting 


0.0 


00 

v.v 


nctu i 


0.3 


01 1 00 MiYeH T vmnhnevte 

-7 J X V/ S 1 VXXA.V U X^jr IXXpXXVV J 

ReactionJTwo Way MLR 


6.0 


00 

V.v 


Fetal Skelpfal 


0.0 


011 1 0 Mixed T vrrmhncvte 

7J1 lv XVXXA.WU i-*JT IXXpXXV W jr lb 

ReactionJTwo Way MLR 


0.0 


00 

v.v 


Q1ri»1^t5*1 Tnncolia 
O&CICUU IlllldUC 




011 1 1 \jfiYF»H T vrrmVi nr vte 

XXX iVXXAOIX l^y lllyll\J\yj LO 

ReactionTwo Way MLR 


0 0 


1 3 

X »J 


duiic xxxaxxuw 


0 0 

V.V 


01117 A/fnnnnnrlftJir fella 

(PBMCs)_resting 


0 0 


18.4 


x xiy 1 1 1 no 


0 0 


011 1 1 A/fnnrmii clear PeTls 

7 J 1 1J XYXAJXlVlXUvlwcLX vvllo 

(PBMCs)_PWM 


00 


0.0 


Cm] A Aft 


0.0 


011 14 AiffinnrmoleaT PellQ 

(PBMCs)_PHA-L 


0 0 

V.V 


6.6 


ijyinpix iiuuc 


0 0 

v.v 


celI)_none 


5 7 


0.0 


Colorectal 


0.1 


93250_Ramos (B 
cell)_ionomycxn 


0.0 


0.0 


Stomach 


0.0 


93349_B 

lymphocytes_PWM 


0.0 


0.0 


Small intestine 


0.0 


93350JB 

lymphoytes CD40L and 
IL-4 ; 


0.0 


0.0 


Colon ca. 
SW480 


0.0 


92665_EOL-l 

(Eosinophil)_dbcAMP 

differentiated 


0.0 


3.2 


Colon ca * 

(SW480 

met)SW620 


0.2 


93248_EOL-l 

(x^smopM)_dbcAMP/rM 

Aionomycin 


0.0 


0.0 


fViInn 
v^vJIUil to. 

HT29 


00 

V.V 


011 Sfi DenHritie 

yJJJ\J XX/XXUX I UV 

CeDsjaone 


00 

V.V 


0.0 


Colon ca. 
HCT-116 


0.0 


93355_Dendritic 
CellsJLPo 100 ng/ml 


0.0 


0.0 


Colon ca. 
CaCo-2 


0.0 


93775 J)endritic 
Cells_anti-CD40 


0.0 


0.0 


83219 CCWell 
to Mod DirT 
(OD03866) 


0.0 


93774_Monocytesjresting 


0.0 


0.0 


Colon ca. 

linn onno 

HCC-2998 


1.3 


93776 JdonocytesJLPS 50 
ng/ml 


0.0 


0.0 


Gastric ca.* 

/Iiitw rr»<*A TVT/T 

(liver mei j x\oi~ 
N87 


0.2 


93581 _Macrophages_restin 


1.1 


0.0 


Bladder 


0.0 


93582 - Macrophages_LPS 
100 ng/ml 


0.0 


5.9 


Trachea 


0.0 


93098JIUVEC 
(Endothelial)_none 


0.0 


0.0 
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Kidney 


0 0 


QinQO T4TTVFP 
(Endothelial)_starved 


00 


0 0 


Kidney (fetal) 


0.0 


93100_HUVEC 
^Endothe hal)_IL- 1 b 


0.0 


0.0 


Renal ca. 
786-0 


0.0 


93779_HUVEC 
Endothelial)_IFN gamma 


0.0 


0.0 


Renal ca. 
A498 


0.5 


93102_HUVEC 
(EndotheHal) JTNF alpha + 
EFN gamma 


0.0 


0.0 


Renal ca. 
RXF393 


0.0 


93101JTOVEC 
(Endothelial) TNF alpha + 
DL4 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


93781_HUVEC 
(Endothehal)_IL-ll 


0.0 


0.0 


Renal ca. 
UO-31 


0.0 


93583 JLung 

Microvascular Endothelial 
Cells none 


0.0 


0.0 


Renal ca. 
1X-10 


0.0 


93584_Lung 

Microvascular Endothelial 
Cells TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


0.0 


0.0 


Liver 


100.0 


92662_Microvascular 
Dermal endothetium_none 


0.0 


0.0 


Liver (fetal) 


0.3 


92663_Microsvasular 
Dermal endothelium TNFa 
(4 ng/ml) and ILlb (1 
ng/ml) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


93773_Bronchial 
epithelium TNFa (4 ng/ml) 
and ILlb (1 ng/ml)** 


0.0 


2.2 


Lung 


0.0 


93347_Small Airway 
Epithelium_none 


0.0 


0.0 


Lung (fetal) 


0.0 


93348_Small Airway 
EpitheliumJTNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


3.7 


Lung ca. (small 
cell) LX-1 


0.0 


92668_Coronery Artery 
SMC_resting 


0.0 


0.0 


Lung ca. (small 
cell) NCI-H69 


0.0 


92669_Coronery Artery 
SMC TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


0.0 


0.0 


Lung ca. (s.celi 
var.) SHP-77 


u.u 


y j l u A — asuocyxes^resung 


ft 0 

U.v 


3 4 


T unff ca flarce 

cell)NCI-H460 


0.0 


93108 astrocvtes TNFa (4 
ng/ml)~and ILlb (1 ng/ml) 


0.0 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.0 


92666JOJ-812 
(Basophil) jresting 


0.0 




Lung ca. (non- 
s.ceU)NQ-H23 


0.0 


92667JOJ-812 
(Basophil)JPMA/ionoycin 


0.0 


0.0 
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Lung ca (non- 
s.cell) HOP-62 


0.0 


93579 CCD1106 
(Keratinocytes)_none 


0.0 


0.0 


Lung ca. (non- 
s.cl)NCI-H522 


0.0 


93580_CCD1106 
(Keratinocytes) TNFaand 
JFNg ** 


0.0 


0.0 


Lung ca. 
(squam.) SW 
900 


0.5 


9379 l_Liver Cirrhosis 


100.0 


100.0 


Lungca. 
(squam.) NCI- 
H596 


0.0 


93792_Lupus Kidney 


0.0 


0.0 


Mammary gland 


0.0 


93577_NCI-H292 


0.0 


0.0 


Breast ca.* (pi. 
effusion) MCF- 
7 


0.0 


93358_NCI-H292_IL-4 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


p3360_NCI-H292_IL-9 


0.0 


0.0 


Breast ca.* (pi. 

effusion) 

T47D 


0.0 


93359_NCI-H292_IL-13 


0.0 


0.0 


oreasi ca. 
BT-549 


fi fi 

u.u 


ztjjD /_in L/i-xizyz_ir in 
gamma 


u.u 


n n 

v.v 


oreasi ca. 
MDA-N 


n fi 








Ovary 


0.0 


93778_HPAEC_EL-1 
beta/TNA alpha 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


93254_Normal Human 

r t^M_ i 1 x 

Lung Fibroblastjione 


3.4 


0.0 


Ovarian ca, 
OVCAR-4 


0.0 


93253_Normal Human 

y T"*"1_ 11 x rnvTT? / /I 

Lung FibroblastJTNFa (4 
ng/ml) and IL-lb (1 ng/ml) 


0.0 


0.0 


uvanan ca. 
OVCAR-5 


fi fi 
U.U 


yjzj /_JNormai Human 
Lung FibroblasML-4 


fi fi 

U.U 




Ovarian ca. 
OVCAR-8 


0.0 


93256_Normal Human 

T 1—1**1 11 A IT /\ 

Lung Fibroblast_IL-9 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


93255JSformal Human 
Lung Fibroblast JL-13 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.0 


93258_Normal Human 
Lung FibroblastJFN 
gamma 


0.0 


0.0 


Uterus 


0.5 


93l06_Dermal Fibroblasts 
CCD1070_resting 


0.0 


0.0 


Plancenta 


0.0 


93361 Dermal Fibroblasts 

CCD1070JTNFalpha4 

ng/ml 


0.0 


0.0 
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Prostate 


0.0 


93105_Dermal Fibroblasts 
CCD1070_IL-1 beta 1 
ng/ml 


0.0 


4.3 


Prostate ca.* 
(bone met)PC-3 


n ft 


fibroblastJFN gamma 




7 0 


Testis 


0.4 


93771_dermal 
nbroblast_IL-4 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


93259JBD Colitis 1** 


6.0 


13.1 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


93260_IBD Colitis 2 


2.8 


5.2 


Melanoma 
UACC-62 


V.S 


yozoi_iDjJ Oronns 


U.U 


1 o 

i.y 


Melanoma 
M14 


0.1 


73501 0_Colon_normal 


1.3 


3.2 


Melanoma 
LOX 1MV1 


0.0 


735019_Lung_none 


14.9 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.4 


64028-l_Thymus_none 


2.3 


7.3 


Adipose 


0.0 


6403 0- 1_ Kidney_none 


0.0 


0.0 



Both probe/primer sets are specific for the sequence of gene Acc. No. MOL3. Unigene 
data at http://ww.ncbi.r^ 

and our RTQ-PCR panels 1 .3D,2D and 4D indicate that this gene is specifically 
5 expressed by the liver and upregulated in some hepatocellular carcinomas (HCCs).There is 

evidence suggesting that the examination of the serum complements may be a useful tool for the 
detection of HCCs in liver cirrosis (LC) patients. (Takezaki E, Murakami S, Nishibayashi H, 
Kagawa K, Ohmori H. GanNo Rinsho 1990 Oct;36(12):21 19-22). Therefore the serum level of 
this protein can be used as a diagnostic marker to detect LC and HCC and antibodies directed 
1 0 against this protein can be a potential therapeutic agent against LC and HCC; In addition, this 
molecule can also serve as a specific marker for differentiating liver from other tissues. 



MOL4 

15 Expression of gene MOL4 was assessed using the primer-probe set Agl 61 1 , described in 

Table 21 
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Table 21. Probe Name: Agl611 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 

NO: 


Forward 


5'-ATATGCTGTGCTGCATTCAGT-3' 


58.4 


21 


40 


85 


Probe 


TET-5'- 

CTGCCTGGTCAGTGAACAATTTCCTG- 
3'-TAMRA 


68.9 


26 


65 


86 


Reverse 


5'-CAAGGCCACACTAGTCGTGTAG-3' 


59.9 


22 


117 


87 | 



Expression of this gene in panels 1 .3D, 4D and 2 is at veiy low to undetectable levels (Ct 
5 values>35) in a number of tissues. 

MOL6 

Expression of gene MOL6a was assessed using the primer-probe set Agl 167, described 
in Table 22. Results of the RTQ-PCR runs are shown in Tables 23 and 24. 
iO Table 22. Probe Name: Agll67 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


S'-TCTTTGCTGACTCATCTGTTCA-S* 


58.7 


22 


51 


88 


Probe 


TET-5- 

AAGAAGACCCTGCTCCCTATTTGGTG- 
3'-TAMRA 


67.5 


26 


75 


89 


Reverse 


5'-AGGGGTTGAAGTGAGACTTGAG-3' 


59.8 


22 


104 


90 



Table 23. Panels 1.3D and 4D 



PANEL 1.3D 




PANEL 4D 




Tissue Name 


Rel. Expr., 
% 

1.3dx4tm55 
86t agl 167 
al 


Tissue Name 


Rel. Expr., 
% 

4Dtml937t 
_agll67 


Liver 

adenocarcinoma 


1.1 


93768 Secondary Thl anti-CD28/anti- 
CD3 


1.7 


Pancreas 


0.0 


93769 Secondary Th2 anti-CD28/anti- 
CD3 


0.0 


Pancreatic ca. 
CAP AN 2 


1.0 


93770 Secondary Trl anti-CD28/anti- 
CD3 


0.9 


Adrenal gland 


0.0 


93573_Secondary Thljresting day 4-6 
inIL-2 


2.7 
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Thyroid 


0.0 


93572_Secondary Th2_resting day 4-6 
inIL-2 


0.0 


Salivary gland 


0.0 


93571_Secondary Trl_resting day 4-6 
in H-2 


0.0 


Pituitary gland 


2.1 


93568 primary Thl anti-CD28/anti- 
CD3 ~ J 


0.0 


Brain (fetal) 


4.1 


93569 ^primary Th2 anti-CD28/anti- 
CD3 


3.0 


Brain (whole) 


1.1 


93570_primaryTrl anti-CD28/anti- 
CD3 


2.4 


Brain (amygdala) 


4.1 


93565_primary Thl resting dy 4-6 in 
IL-2 


5.3 


Brain (cerebellum) 


6.7 


93566_primary Th2 resting dy 4-6 in 
IL-2 


3.7 


Brain (hippocampus) 


4.3 


93567jprimary Trl resting dy 4-6 in 
EL-2 


3.7 


Brain (substantia 
nigra) 


5.4 


,93351_CD45RA CD4 
lymphocyte jmti-CD28/anti-CD3 


1.6 


Brain (thalamus) 


4.7 


93352_CD45RO CD4 
lymphocyte_anti-CD28/anti-CD3 


2.6 


uereDrai cortex 


1 1 


yoZM_L/JJo Lympnocytes_aiiti- 
CD28/anti-CD3 


A Q 1 

u.y | 


Spinal cord 


1 £ 
1.0 


yo3Di_cnronic cjjo Lympnocytes 
2ry_resting dy 4-6 in EL-2 




CNS ca folio/astro^ 
U87-MG 


0.0 


93 S74 chronic CDS T vmnlioovte^ 
2ry_activated CD3/CD28 


0 9 


CNSca. (glio/astio) 
U-118-MG 


0.0 


93354_CD4_none 


1.0 


CNS ca. (astro) 
SW1783 


0.0 


93252 Secondary Thl/Th2/Trl anti- 
CD95 CH11 


2.5 


CNS ca.* (neuro; met 
) SK-N-AS 


0.0 


93 103 JLAK cells j-esting 


7.0 


CNS ca. (astro) 
SF-539 


0.0 


93788JLAKcells__IL-2 


0.9 


CNS ca. (astro) 
SNB-75 


2.4 


93787_LAK cells JL-2+IL-12 


1.6 


CNS ca. (glio) 
SNB-19 


4.3 


93789_LAK cells JL-2+IFN gamma 


5.8 


CNS ca. (glio) 
U251 


1.3 


93790JLAK cells JL-2+ IL-18 


1.7 


CNS ca. (glio) 
SF-295 


0.0 


93104JLAK cells JPMA/ionomycin 
and IL-18 


0.0 


Heart (fetal) 


0.0 


93578JMK Cells IL-2_resting 


2.2 


Heart 


0.0 


93 109_Mixed Lymphocyte 
Reaction_Two Way MLR 


2.4 


Fetal Skeletal 


0.0 


93 1 1 OJVlixed Lymphocyte 
Reaction_Two Way MLR 


1.6 
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Skeletal muscle 


1.0 


93 1 1 IMixed Lymphocyte 
Reaction Two Way MLR 


0.0 


Bone marrow 


0.0 


93 1 ^Mononuclear Cells 
(PBMCs) resting 


0.0 


Thymus 


0.0 


93113 Mononuclear Cells 
(PBMCs) PWM 


6.0 


Spleen 


16.0 


93 1 ^Mononuclear Cells 
(PBMCs) PHA-L 


0.9 


Lymph node 


1.4 


93249_Ramos (B cell)_none 


6.8 


Colorectal 


0.0 


93250_Ramos (B cell)_jonomycin 


5.9 


Stomach 


0.0 


93349 JB lymphocytes JPWM 


3.8 


Small intestine 


2.9 


93350_B rymphoytes_CD40L and IL-4 


2.4 


Colon ca. 
SW480 


0.0 


92665 JBOL-1 (Eosinophil) jibcAMP 
differentiated 


82.9 


Colon ca.*(SW480 
met)SW620 


1.7 


93248 JBOL-1 

(Eosinophil)_dbcAMP/PMAionomycin 


100.0 


Colon ca. 
HT29 


0.0 


93356_Dendritic Cells jaone 


2.5 


Colon ca. 
HCT-116 


3.9 


93355JDendritic Cells_LPS 100 ng/ml 


0.9 


Colon ca. 
CaCo-2 


1.0 


93775_Dendritic Cells_anti-CD40 


2.5 


83219 CCWell to 
ModDiff(OD03866) 


0.0 


93774_Monocytes_resting 


0.0 


Colon ca. 
HCC-2998 


1.9 


93776 JMonocytesJJPS 50 ng/ml 


0.8 


Gastric ca.* (liver 
met) NCI-N87 


2.3 


9358 l_Macrophages_resting 


0.8 


Bladder 


4.9 


93582_Macrophages_LPS 100 ng/ml 


0.4 


Trachea 


0.0 


93098JSUVEC (Endothelial)_none 


0.9 


Kidney 


1.0 




i < 


Kidney (fetal) 


2.3 


93100_HUVEC (Endothelial)_IL-lb 


0.0 


Renal ca. 
786-0 


0.0 


93779_HUVEC (Endothelial)_IFN 
gamma 


2.4 


Renal ca. 
A498 


0.9 


93102_HUVEC (Endothelial)_TNF 
alpha + IFN gamma 


0.0 


Renal ca. 
RXF393 


0.0 


93101_HUVEC (Endomelial)_TNF 
alpha + IL4 


0.0 


Renal ca. 
ACHN 


1.5 


93781_HUVEC (EndotheliaI)_IL-l 1 


0.9 
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Renal ca. 
UO-31 


0.0 


93583_Lung Microvascular Endothelial 
Cells_none 


2.1 


Renal ca. 
TK-10 


4.0 


93584 Lung Microvascular Endothelial 
CellsJTNFa (4 ng/ml) and ILlb (1 
ng/ml) 


1.3 


Liver 


0.0 


92662 ^Microvascular Dermal 
endothelium none 


1.2 


Liver (fetal) 


0.0 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and ILlb 
(1 ng/ml) 


1.5 


Liver ca. 

(hepatoblast) HepG2 


0.5 


93773_Bronchial epithelium_TNFa (4 
ng/ml) and ILlb (1 ng/ml) ** 


44.1 


Lung 


0.0 


93347_Small Airway Epithelium_none 


5.6 


Lung (fetal) 


2.2 


93348_Small Airway 

Epithelium TNFa (4 ng/ml) and ILlb 

(1 ng/ml) 


25.7 


Lung ca. (small cell) 
LX-1 


5.2 ; 


92668_Coronery Artery SMC_resting 


0.7 


Lung ca. (small cell) 

MPT WfiO 


2.8 


92669_Coronery Artery SMCJTNFa (4 
ng/mij ana ii^iu yi ng/nuj 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.8 


93 1 07_astrocytes_resting 


3.5 


Lung ca. (large 


2.0 


93 108 jistrocytesJTNFa (4 ng/ml) and 

TT 1h (\ r\olrr\\\ 


1.4 


Lung ca. (non-sm. 
cell)A549 


1.8 


92666_KU-8 1 2 (Basophil)_resting 


20.4 


Lung ca. (non-s.cell) 
NCI-H23 


4.7 


92667_KU-812 
(Basophil)_PMA/ionoycin 


34.2 


Lung ca (non-s.cell) 
HOP-62 


1.3 


93579_CCD1 106 (Keratinocytes)_none 


0.7 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


93580_CCD1106 

(Keratinocytes)_TNFa and IFNg ** 


19.1 


Lung ca. (squam.) 
SW 900 


0.0 


93791_Liver Ciirhosis 


3.2 


Lung ca. (squam.) 
NCI-H596 


2.2 


93792_Lupus Kidney 


5.2 


Mammary gland 


0.8 


93577 NCI-H292 


6.3 


Breast ca.* (pi. 
effusion) MCF-7 


3.8 


93358_NCI-H292_IL-4 


7.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


93360_NCI-H292_IL-9 


6.5 


Breast ca.* (pi. 
effusion) T47D 


0.0 


93359_NCI-H292_IL-13 


8.8 


Breast ca. 
BT-549 


0.0 


93357_NCI-H292_IFN gamma 


4.2 
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Breast ca. 

MDA-N j 


0.0 


93777_HPAEC_- 


0.9 


Ovary 


0.0 


93778_HPAEC_IL-1 beta/TNA alpha 


1.4 


Ovarian ca. 
OVCAR-3 


0.0 


93254_Normal Human Lung 
Fibroblast none 


0.0 


Ovarian ca. 
OVCAR-4 


1.6 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and IL-lb 
(1 ng/ml) 


0.0 


Ovarian ca. 
OVCAR-5 


1.9 


93257_Normal Human Lung 
Fibroblast DL-4 


1.5 


Ovarian ca. 
OVCAR-8 


2.8 


93256 Normal Human Lung 
Fibroblast IL-9 


0.0 


Ovarian ca. 

TPtPHV 1 
IVJAAJ V - 1 


2.7 


93255_Normal Human Lung 
iuroDiasi__LL- 1 j 


1.9 


Ovarian ca.* (ascites) 

aJv-U V -J 


5.6 


93258_Nonnal Human Lung 
x 4 JDroDiasi_ir.i\ gamma 


2.2 


Uterus 


0.6 


93106_Dermal Fibroblasts 
CCD1070_resting 


0.8 


Plancenta 


1.0 


93361_Dermal Fibroblasts 
CCD1070_TNF alpha 4 ng/ml 


1.6 


Prostate 


0.0 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


93772_dermal fibroblast_IFN gamma 


2.0 


Testis 


100.0 


93771_dermal fibroblastJL-4 


0.9 


Melanoma 
Hs688(A).T 


0.0 


93259_ffiD Colitis 1** 


21.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


93260_IBD Colitis 2 


1.3 


Melanoma 
UACC-62 


0.0 


93261JBD Crohns 


0.7 


Melanoma 
M14 


0.9 


735010_Colon_normal 


0.7 


Melanoma 
LOXIMVI 


0.0 


735019_Lung_none 


2.2 


Melanoma* (met) 
SK-MEL-5 


0.0 


64028-l_Thymus_none 


41.8 


Adipose 


1.9 


64030-l_Kidney_none 


6.0 



Table 24. Panels 2D and 3D 



PANEL 2D 




PANEL 3D 




Tissue Name 


Rel. Expr., 
% 


Tissue Name 


Rel. Expr., 
% 
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2Dtm2324t 
agll67 




3dtm5309t 
_agll67 


Normal Colon 
GENPAK 061003 


2.9 


94905_Daoy_Medulloblastoma/Cerebell 
um sscDNA 


4.0 


83219 CC Well to 
ModDiff(OD03866) 


2.5 


94906_TE67 l_Medulloblastom/Cerebel 
lum sscDNA 


0.0 


83220 CC NAT 
(OD03866) 


18.9 


94907_D283 

Med Medulloblastoma/Cerebelhim ssc 
DNA 


8.4 


83221 CC Gr.2 

rectosigmoid 

(OD03868) 


2.2 


94908_PFSK-l_Primitive 
Neuroectodermal/Cerebellum_sscDNA 


7.0 


83222 CCNAT 
(OD03868) 


0.0 


94909_XF-498_CNS_sscDNA 


0.0 


83235 CC Mod Diff 
(ODO3920) 


6.3 


94910_SNB-78_CNS/glioma_sscDNA 


10.2 


83236 CCNAT 
(ODO3920) 


12.6 


9491 1_SF- 

268_CNS/glioblastoma_sscDNA 


0.0 


83237 CC Gr.2 
ascend colon 
(OD03921) 


0.0 


94912_T98G_Glioblastoma_sscDNA 


0.0 . 


83238 CC NAT 
(OD03921) 


7.7 


96776_SK-N-SH_Neuroblastoma 
(metastasis)_sscDNA 


0.0 


83241 CC from 
Partial Hepatectomy 
(ODO4309) 


2.9 


94913_SF- 

295_CNS/glioblastoma_sscDNA 


0.0 


83242 Liver NAT 
(ODO4309) 


0.0 


949 14_Cerebellum_sscDNA 


66.0 


87472 Colon mets to 
lung (OD0445 1-01) 


5.0 


96777_Cerebellum_sscDNA 


4.4 


87473 Lung NAT 
(OD04451-02) 


9.9 


94916_NCI-H292_Mucoepidermoid 
lung carcinomasscDNA 


7.3 


Normal Prostate 
Clontech A+ 6546- 
1 


17.4 


94917_DMS-1 14_Small cell lung 
cancer_sscDNA 


0.0 


84140 Prostate 
Cancer (OD04410) 


2.7 


94918_DMS-79_Small cell lung 
cancer/neuroendocrine sscDNA 


100.0 


84141 Prostate NAT 
(OD04410) 


262 


94919_NCI-H146_Small cell lung 
cancer/neuroendocrine sscDNA 


16.4 


87073 Prostate 
Cancer (OD04720- 
01) 


20.6 


94920_NCI-H526_Small cell lung 
cancer/neuroendocrine_sscDNA 


2.1 


87074 Prostate NAT 
(OD04720-02) 


13.9 


94921_NCI-N417_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 


Normal Lung 
GENPAK 061010 


17.7 


94923_NCI-H82_Small ceU lung 
cancer/neuroendocrine sscDNA 


3.5 


83239 Lung Met to 
Muscle (OD04286) 


5.0 


94924_NCI-H157_Squamous cell lung 
cancer (metastasis)sscDNA 


0.0 


83240 Muscle NAT 


2.6 


94925_NCI-H1 1 55_Large cell lung 

— ~~~-f~~„ — A~~~ #.r.«T\\T A 


0.0 
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(OD04286) 




/ 1 " T*VXT A. 

cancer/neuroendocnne_sscDNA 




84136 Lung 
Malignant Cancer 
(OD03126) 


4.9 


94926 JTCl-H1299_Large cell lung 
cancer/neuroendocrine_sscDNA 


11.3 


84137 Lung NAT 
(OD03126) 


0.0 


94927JsO-H727_Lung 
carcinoid sscDNA 


3.0 


84871 Lung Cancer 
(OD04404) 


5.5 


94928_NCI-UMC-1 l_Lung | 
carcinoid sscDNA 


10.1 


84872 Lung NAT 
(OD04404) 


2.4 


94929_LX-l_Small cell lung 
cancer sscDNA 


0.0 


84875 Lung Cancer 
(OD04565) 


2.5 


94930_Colo-205_Colon 
cancer sscDNA 


0.0 


84876 Lung NAT 
(OD04565) 


7.7 


94931_KM12_Colon cancer_sscDNA 


4.9 


85950 Lung Cancer 
(OD04237-01) 


18.4 


94932 JCM20L2_Colon 
cancer_sscDNA 


0.0 


85970 Lung NAT 
(OD04237-02) 


7.8 


94933_NCI-H716_Colon 
cancer sscDNA 


0.0 


83255 Ocular Mel 
Met to Liver 
(ODO4310) 


0.0 


94935J5W-48_Colon 
adenocarcinoma_sscDNA 


0.0 


83256 Liver NAT 
(ODO4310) 


0.0 


94936_SW1 1 16_Colon 
adenocarcinoma sscDNA 


1.6 


84139 Melanoma 
Mets to Lung 
(OD04321) 


3.0 


94937_LS 174T_Colon 
adenocarcinomajsscDNA 


0.0 


84138 Lung NAT 
(OD04321) 


2.6 


9493 8 JSW-948_Colon 
adenocarcinoma sscDNA 


4.4 


Normal Kidney 
GENPAK 061008 


100.0 


94939JSW-480_Colon 
adenocarcinoma sscDNA 


0.0 


83786 Kidney Ca, 
Nuclear grade 2 
(OD04338) 


56.6 


94940_NCI-SNU-5_Gastric 
carcinoma_sscDNA 


0.0 


83787 Kidney NAT 
(OD04338) 


23.2 


94941_KATO Hljaastric 
carcinoma sscDNA 


2.6 


83788 Kidney Ca 
Nuclear grade 1/2 
(OD04339) 


0.0 


94943_NCI-SNU-16_Gastric 
carcinoma sscDNA 


3.6 


83789 Kidney NAT 
(OD04339) 


46.0 


94944 _NCI-SNU-lJjastric 
carcinoma sscDNA 


0.0 


83790 Kidney Ca, 
Clear cell type 
(OD04340) 


12.9 


94946_RF-lJ3astric 
adenocarcinoma__sscDNA 


0.0 


83791 Kidney NAT 
(OD04340) 


24.5 


94947 JRF-48_Gastric 
adenocarcinoma sscDNA 


0.0 


83792 Kidney Ca, 
Nuclear grade 3 
(OD04348) 


4.5 


96778 - MKN-45_Gastric 
carcinoma_sscDNA 


0.0 
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83793 Kidney NAT 
(OD04348) 


44.4 


94949_NCI-N87j3astric 
carcinoma sscDNA 


0.0 


87474 Kidney Cancer 
(OD04622-01) 


2.8 


9495 l_OVC AR-5_Ovarian 
carcinoma sscDNA 


0.0 


87475 Kidney NAT 
(OD04622-03) 


0.0 


94952 JlL95-2JJterine 
carcinoma sscDNA 


0.0 


85973 Kidney Cancer 
(OD04450-01) 


4.4 


94953_HelaS3_Cervical 
adenocarcinoma sscDNA 


0.0 


85974 Kidney NAT 
(OD04450-03) 


24.8 


94954_Ca Ski_Cervical epidermoid 
carcinoma (metastasis)_sscDNA 


5.8 


Kidney Cancer 
Clontech 8120607 


0.0 


94955_ES-2_Ovarian clear cell 
carcinoma sscDNA 


0.0 


Kidney NAT 
Clontech 8120608 


0.0 


94957 _Ramos/6h stimj 1 ; Stimulated 
with PMA/ionomycin 6h sscDNA 


0.0 


Kidney Cancer 
Clontech 8120613 


0.0 


94958_Ramos/14h stunj'; Stimulated 
with PMA/ionomycin 14h_sscDNA 


0.0 


Kidney NAT 
Clontech 8120614 


0.0 


94962_MEG-01_Chronic myelogenous 
leukemia (megokaryoblast)_sscDNA 


28.3 


Kidney Cancer 
Clontech 9010320 


2.6 


94963 JlajiJBiirknfs 
lymphoma_sscDNA 


0.0 


Kidney NAT 
Clontech 9010321 


8.4 


94964_Daudi_Burkitt , s 
lymphoma sscDNA 


2.9 


Normal Uterus 
GENPAK061018 


0.0 


94965_U266_B-cell 
plasmacytoma/myelomasscDNA 


6.4 


Uterus Cancer 
GENPAK 064011 


16.2 


94968_CA46_Burkitt J s 
lymphoma_sscDNA 


0.0 


Normal Thyroid 
Clontech A+ 6570- 
1 


1.8 


94970JRL_non-Hodgkin , s B-cell 
lymphomasscDNA 


0.0 


Thyroid Cancer 
GENPAK 064010 


27.2 


94972_JMl_pre-B-cell 
lymphoma/leukemiasscDNA 


6.7 


Thyroid Cancer 
INVTTROGEN 
A302152 


5.2 


94973_Jurkat_T cell leukemia_sscDNA 


4.2 


Thyroid NAT 

INVTTROGEN 

A302153 


19.9 


94974_TF-l_Erythroleukemia_sscDNA 


2.4 


Normal Breast 
GENPAK 061019 


47.3 


94975 JIUT78_T-cell 
lymphoma_sscDNA 


2.8 


84877 Breast Cancer 
(OD04566) 


7.0 


94977_U937_Histiocytic 
lymphoma_sscDNA 


6.6 


85975 Breast Cancer 
(OD04590-01) 


15.7 


94980JKU-8 1 2_Myelogenous 
leukemia sscDNA 


18.2 


Rrea«;t Cancer 

Mets (OD04590-03) 


17 7 

±1.1 


949R1 769-P CleaT cell renal 
carcinoma sscDNA 


2.6 


87070 Breast Cancer 

Metastasis 

(OD04655-05) 


76.8 


94983_Caki-2_Clear cell renal 
carcinoma_sscDNA 


0.0 


GENPAK Breast 


42.6 


94984_SW 839_Clear cell renal 


1.9 



179 



WO 01/81578 



PCT/US01/13578 



Cancer 064006 




carcinoma_sscDNA 




Breast Cancer Res. 
Gen. 1024 


47.6 


94986_G401_Wilms l tumor_sscDNA 


0.0 


Breast Cancer 
Clontech 9100266 


2.6 


94987_Hs766T_Pancreatic carcinoma 
(LN metastasis) sscDNA 


0.0 


Breast NAT Clontech 
9100265 


5.1 


94988_CAPAN-l_Pancreatic 
adenocarcinoma (liver 
metastasis) sscDNA 


0.0 


Breast Cancer 

INVTTROGEN 

A209073 


10.7 


94989__SU86.86_Pancreatic carcinoma 
(liver metastasis)_sscDNA 


3.2 


Breast NAT 

INVTTROGEN 

A2090734 


12.9 


94990_BxPC-3_Pancreatic 
adenocaicinoma_sscDNA 


0.0 


Normal Liver 
GENPAK 061009 


13.1 


94991_HPAC_Pancreatic 
adenocarcinoma sscDNA 


3.6 


Liver Cancer 
GENPAK 064003 


4.8 


94992_MIA PaCa-2 _Pancreatic 
carcinoma sscDNA 


0.0 


Liver Cancer 
Research Genetics 
RNA 1025 


2.5 


94993_CFPAC-l_Pancreatic ductal 
adenocarcinoma_sscDNA 


13.6 


Liver Cancer 
Research Genetics 
RNA 1026 


0.0 


94994__PANC-l_Pancreatic epithelioid 
ductal carcinoma_sscDNA 


0.0 


Paired Liver Cancer 
Tissue Research 
Genetics RNA 6004- 
T 


5.4 


94996_T24_Bladder carcinma 
(transitional cell)_sscDNA 


3.1 


Paired Liver Tissue 
Research Genetics 
RNA 6004-N 


2.5 


94997_5637_Bladder 
carcinoma_sscDNA 


0.0 


Paired Liver Cancer 
Tissue Research 
Genetics RNA 6005- 
T 


0.0 


94998 JHT-1197_Bladder 
carcinoma_sscDNA 


20.2 


Paired Liver Tissue 
Research Genetics 
RNA 6005-N 


0.0 


94999__UM-UC-3_Bladder carcinoma 
(transitional cell)_sscDNA 


0.0 


Normal Bladder 
GENPAK 061001 


3.1 


95000 A204 Rhabdomyosarcoma ssc 
DNA 


3.3 


Bladder Cancer 
Research Genetics 
RNA 1023 


0.0 


95001JIT- 

1 080_Fibrosarcoma_sscDNA 


5.2 


Bladder Cancer 
INVTIROGEN 
A302173 


16.7 


95002_MG-63_Osteosarcoma 
(bone)_sscDNA 


0.0 


87071 Bladder 
Cancer (OD04718- 
01) 


0.0 


95003JSK-LMS-lJLriomyosarcoma 
(vuIva)_sscDNA 


7.9 



180 



WO 01/81578 



PCT/US01/13578 



87072 Bladder 
Normal Adjacent 
(OD04718-03) 


2.9 


95004_SJRH30_Rhabdomyosarcoma 
(met to bone marrow)_sscDNA 


0.0 


Normal Ovaiy Res. 
Gen. 


0.0 


95005_A43 1 ^Epidermoid 
carcinoma sscDNA 


0.0 


Ovarian Cancer 
GENPAK 064008 


12.6 


95007_WM266-4_Melanoma_sscDNA 


0.0 


87492 Ovary Cancer 
(OD04768-07) 


2.9 


95010_DU 145_Prostate carcinoma 
(brain metastasis)_sscDNA 


0.0 


87493 Ovary NAT 
(OD04768-08) 


0.0 


95012JMDA-MB-468_Breast 
adenocarcinoma sscDNA 


0.0 


Normal Stomach 
GENPAK 061017 


5.9 


95013 JSCC-4_Squamous cell 
carcinoma of tonguesscDNA 


0.0 


Gastric Cancer 
Clontech 9060358 


0.0 


95014_SCC-9_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


NAT Stomach 
Clontech 9060359 


0.0 


95015_SCC-15_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


Gastric Cancer , 
Clontech 9060395 


2.9 


95017_CAL 27_Squamous cell 
carcinoma of tongue_sscDNA 


0.0 


NAT Stomach 
Clontech 9060394 


0.0 






Gastric Cancer 
Clontech 9060397 


0.0 






NAT Stomach 
Clontech 9060396 


0.0 






Gastric Cancer 
GENPAK 064005 


10.5 







Expression of gene MOL6 in panel 1.3D is detected in the testes and at very low levels in 
the spleen, but not in any other tissues. Expression in panel 2D indicates higher expression in 
5 normal kidney and markedly lower expression in kidney cancer. This indicates a potential role 
for this gene as a protein therapeutic in cases of kidney cancer. In panel 3D, expression is seen to 
be highest in a specimen of small cell lung cancer, chrome myelogenous leukemia and bladder 
cancer. 

Expression of gene MOL6 in Panel 4D is up-regulated in keratinocytes and small airway 
10 epithelium after treatment with TNF alpha and IL-1 beta. It is also upregulated in eosinophils 
regardless of treatment and in normal thymus. 

Potential Role(s) of MOL6 in Inflammation: The expression pattern of GM_87760758_A 
shows that it is induced in keratinocytes and small airway epithelium in response to pro- 
inflammatoiy cytokines. Thus the protein in question may contribute to tissue destruction in the 
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airways, recruitment of leukocytes, and tissue remodeling (Reichart et al., 1 996 J. Pathol. 1 78 
(2): 215-20). 

Impact of Therapeutic Targeting of MOL6: Antibodies or small molecule therapeutics to 
MOL6 may Teduce or inhibit tissue damage due to inflammation in psoriasis, asthma and other 
5 mast cell-mediated diseases both in the skin and in the airways. The results are also suggestive 
of a potential role for MOL6 in the treatment for emphysema (Rice et al., 1998 Curr Pharm Des 
4(5): 381-96). 

MOL7 

10 Expression of gene MOL7 was assessed using the primer-probe sets Agl 876, described 

in Table 25. Results of the RTQ-PCRruns are shown in Table 26. 



Table 25. Probe name: Agl876 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 

Positio 

n 


SEQ 
ID NO: 


Forward 


5'-AGCAAGATTGCTCACACAGAGT-3' 


59.2 


22 


668 


91 


Probe 


TET-5'- 

CCAGTCAATACCATCATCATCCATGAGG- 
3-TAMRA 


69.1 


28 


692 


92 


Reverse 


5-.TATGTTGTTGCTCATGGAGTTG-3' 


58.7 


22 


730 


93 



15 

Table 26. Panel 1.3D 



Tissue Name 


Rel. Expr., % 
1.3dx4tm5422t 
agl876_al 


Tissue Name 


Rel. Expr., % 
1.3dx4tm5422t 
_agl876_al 


Liver 

adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 


786-0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.4 


Renal ca. 


A498 


0.5 


Adrenal gland 


0.7 


Renal ca. 


RXF393 


0.0 


Thyroid 


. 0.0 


Renal ca. 


ACHN 


0.6 


Salivary gland 


0.4 


Renal ca. 


UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. 


TK-10 


0.4 


Brain (fetal) 


03 


Liver 


0.0 
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Brain (whole) 


4.2 


Liver (fetal) 


A ft 

u.o 


Brain (amygdala) 


2.5 


Liver ca. (hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.8 


Lung 


0.0 


Brain (hippocampus) 


2.7 


Lung (fetal) 


0.0 


Brain (substantia 
nigra) 


1.4 


Lung ca. (small cell) LX-1 


1.3 


Brain (thalamus) 


1.6 


Lung ca. (small cell) NCI- 
H69 


0.0 


Cerebral Cortex 


0.6 


Lung ca. (s.cell var.) SHP-77 


0.0 


Spinal cord 


0.4 


Lung ca. (large cell)NCI-H460 


0.4 


CNS ca. (glio/astro) 
U87-MG 


0.0 


Lung ca. (non-sm. cell) A549 


0.5 


CNSca. (glio/astro) 
U-118-MG 


0.2 


Lung ca. (non-s.cell) NCI-H23 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


Lung ca (non-s.cell) HOP-62 


0.0 


CNS ca.* (neuro; met 
) SK-N-AS 


0.7 


Lung ca. (non-s.cl) NCI-H522 


0.3 


CNS ca. (astro) 
SF-539 


0.0 


Lung ca. (squam.) SW 900 


.0.8 


CNS ca. (astro) 
SNB-75 


0.0 


Lung ca. (squam.) NCI-H596 


0.0 


CNS ca. (glio) 
SNB-19 


2.2 


Mammary gland 


0.0 


CNS ca. (glio) 
U251 


1.3 


Breast ca * (pi. effusion) 
MCF-7 


2.9 


CNS ca. (glio) 
SF-295 


0.4 


Breast ca* (pl.ef) MDA-MB- 
231 


0.0 


Heart (fetal) 


0.0 


Breast ca * (pi. effusion) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT- 
549 


0.6 


Fetal Skeletal 


0.0 


Breast ca. MDA- 
N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


1.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.3 


Lymph node 


0.5 


Ovarian ca. OVCAR- 
8 


0.8 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 
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Stomach 


0.4 


Ovarian ca.* (ascites) SK-OV- 
3 


0.6 


Small intestine 


0.7 


Uterus 


0.0 


Colon ca. 
SW480 


0.0 


Plancenta 


5.5 


Colon ca.*(SW480 
met)SW620 


0.6 


Prostate 


0.0 


Colon ca 
HT29 


0.0 


Prostate ca.* (bone met)PC-3 


0.0 


Colon ca. 
HCT-116 


0.3 


Testis 


100.0 


Colon ca. 
CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


83219 CC Well to 
Mod DifrYOD03 866^ 


0.0 


Melanoma* (met) Hs688(B).T 


0.0 


Colon ca. 
HCC-2998 


0.2 


Melanoma UACC-62 


1.4 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOXIMVT 


0.0 


Trachea 


32.3 


Melanoma* (met) SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


8.2 



Expression of MOL7 in panel 1.3D was highest in testis, followed by trachea. Expression 
in the brain is at much lower levels. This molecule may therefore have a role in male fertility. 
There was low to undetectable expression in the samples of panel 4D (CT values >35). 



MOL8b 

Expression of gene MOL8b was assessed using the primer-probe sets Ag3183, described 
10 in Table 27. Results of the RTQ-PCR runs are shown in Table 28. 



Table 27. Probe name: Ag3183 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 

NO: 


Forward 


5'-AAGGGGACGAGTGTGGGATT-3' 


62 


20 


211 


94 


Probe 


TET-5'-TGGCACCGAAGTAGCCGTGGCG- 
3'-TAMRA 


74 


22 


301 


95 


Reverse 


5-GCGGGCACTTGGTGTCGCA-3' 


64 


19 


325 


96 
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Table 28. Panel 4D 



Tissue Name 


Rel. Expr., % 
4dx4tm4998t 
ag3183_a2 


tissue Name 


Rel. Expr., % 
4dx4tm4998t a 
g3183_a2 


93768__Secondary Thl_anti- 
CD28/anti-CD3 


0.0 


93100JIUVEC 
(Endothelial)JL-lb 


5.7 


93769_Secondary Th2_anti- 
CD28/anti-CD3 


0.0 


93779_HUVEC 

(Endothelial)_IFN 

gamma 


18.4 


93770_Secondary Trl_anti- 
CD28/anti-CD3 


0.0 


93102_HUVEC 
(EndothelialLTNF 
alpha + IFN gamma 


9.6 


93573_Secondary Thljresting 
day 4-6 in IL-2 


0.0 


93101_HUVEC 
(Endothelial)__TNF 
alpha + IL4 


11.2 


93572_Secondary Th2_resting 
day 4-6 in IL-2 


0.0 


93781_HUVEC 
(EndomeUal)JL-ll 


19.3 


93571__Secondary Trljresting 
day 4-6 in IL-2 


2.5 


93583_Lung 
Microvascular ! 
Endothelial Cells none 


21.6 


93568 primary Thl_anti- 
CD28/anti-CD3 


0.0 


93584_Lung 
Microvascular 
Endothelial 
Cells_TNFa (4 ng/ml) 
andILlb(l ng/ml) 


32.1 


93569 jrimary Th2_anti- 
CD28/antd-CD3 


0.0 


92662_Microvascular 
Dennal 

endothelium none 


10.7 


93570_primary Trl_anti- 
CD28/anti-CD3 


0.0 


92663_Microsvasular 
Dennal 

endotheliumJTNFa (4 
ng/ml) and ILlb (1 
ng/ml) 


5.0 


93565 ^primary Thl resting dy 
4-6 in IL-2 


0.0 


93773_Bronchial 
epimeh'umJTNFa (4 
ng/ml) and DLlb (1 
ng/ml) ** 


11.6 


93566_primaiy Th2_resting dy 
4-6 in IL-2 


2.3 


93347JSmall Airway 
Epithelium_none 


5.6 


93567_primaiy Trl resting dy 
4-6inEL-2 


0.7 


93348_Small Airway 
EpimeliumJTNFa (4 
ng/ml) and ILlb (1 
ng/ml) 


4.8 


93351_CD45RA CD4 
lymphocyte anti-CD28/anti- 
CD3 


29.9 


92668_Coronery 
Artery SMC_resting 


46.8 
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93352_CD45ROCD4 
lymphocyte anti-CD2S/anti- 
CD3 


0.0 


92669_Coronery 
Artery SMCJTNFa (4 
ng/ml) and ILlb (1 
ag/ml) 


45.1 


93251_CD8 

Lymphocytes anti-CD28/anti- 
CD3 


0.0 


93 1 07__astrocytes _resti 
ng 


82.4 


93353_chronic CD8 
Lymphocytes 2ry_resting dy 4- 
6inIL-2 


0.0 


93 1 08 jistrocytes_TNF 
a (4 ng/ml) and ILlb (1 
ng/ml) 


74.9 ! 


93574_chronic CDS 
Lymphocytes 2ry activated 
CD3/CD28 


0.0 


92666JOJ-812 
(Basophil)_resting 


0.0 


93354_CD4_none 


0.4 


92667J3J-812 
(Basophil) JPMA/ionoy 
cin 


0.0 


93252 Secondary 
Thl/Th2/Trl anti-CD95 CHI 1 


2.1 


93579_CCD1106 
Xeratinocytes)_none 


0.0 


93103_LAK cells_resting 


0.0 


93580_CCDll06 
(Keratinocytes)JTNFa 
and LrNg 


A A 

0.0 


93788JLAK cells JL-2 


0.0 


9379l_Liver Cirrhosis 


5.6 


93787JLAK cells_IL-2+IL-12 


0.0 


93792_Lupus Kidney 


9.3 


93789_LAK ceUsJL-2+IFN 
gamma 


0.0 


93577JNCI-H292 


2.5 


93790JLAK cells_IL-2+ IL-18 


0.0 


93358JSO-H292JL-4 


0.0 


931G4JLAK 

cells PMA/ionomycin and IL- 
18 


0.0 


93360_NCI-H292JL-9 


0.0 


93578JSIK Cells IL-2_resting 


0.0 


93359 NCI-H292 IL- 
13 


3.8 


93 109_Mixed Lymphocyte 
ReactionJTwo Way MLR 


0.0 


93357_NCI-H292_IFN 
gamma 


0.0 


9311 0_Mixed Lymphocyte 
Reaction_Two Way MLR 


0.0 


93777_HPAEC_- 


31.9 


93 1 1 l_Mixed Lymphocyte 
ReactionJTwo Way MLR 


0.0 


93778JIPAECJL-1 
beta/TNA alpha 


22.4 


93 1 12_Mononuclear Cells 
(PBMCs)_resting 


0.0 


93254 JsTormal Human 
Lung Fibroblastnone 


44.4 


93113 Mononuclear Cells 
(PBMCs)JPWM 


1.8 


93253_Nonnal Human 
Lung Fibroblast JTNFa 
(4ng/ml)andIL-lb(l 
ng/ml) 


28.3 


93114 Mononuclear Cells 
(PBMCs) PHA-L 


0.0 


93257_Normal Human 
Lung Fibroblast_IL-4 


62.6 i 


93249_Ramos (B cell)_none 


0.0 


93256_Normal Human 
Lung Fibroblast_IL-9 


100.0 
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93250_Ramos (B 
cell)_ionomycin 


0.0 


93255_Nonnal Human 
Lung Fibroblast_IL-13 


73.3 


93349JB lymphocytes_PWM 


0.0 


93258_Normal Human 
Lung FibroblastJFN 
gamma 


57.1 


93350_B lymphoytes_CD40L 
andIL-4 


0.0 


93106_Dennal 
Fibroblasts 
CCD 1070 resting 


91.5 


92665J20L-1 

(Eosinophil)_dbcAMP 

differentiated 


0.0 


93361_Dermal 
Fibroblasts 

CCD1070_TNFalpha4 
ng/ml 


31.1 


93248J50L-1 

(EosinopMl)_dbcAMP/PMAio 
nomycin 


1.2 


93105_Dermal 
Fibroblasts 

CCD1070_IL-1 beta 1 
ng/ml 


69.7 


93356_Dendritic Cells_none 


1.8 


93772_dermal 
ffhrnhla^t TFTtf pamma 

llUtUUJlual XX. 1- ~ {V* iXMXXXtt 


18.3 


93355_Dendritic CellsJLPS 


0.0 


93771_dermal 


22.4 


93775 Dendritic Cells anti- 
CD40 


0.4 


93259_IBD Colitis 1** 


32 


93 774_Monocytes_resting 


2.8 


93260JBD Colitis 2 


0.0 


93776 JvIonocytes_LPS 50 

np'/ml 

4.1X1 


0.0 


93261_IBD Crohns 


0.8 


93581_Macrophages_testing 


0.0 


7350 1 0_Colon_normal 


8.9 


93582_Macrophages_LPS 100 
ng/ml 


0.0 


735019_Lung_none 


18.5 i 


93098_HUVEC 
(Endothelial)_none 


8.4 


64028-l_Thymus_none 


9.4 


93099_HUVEC 
(Endothelial) starved 


29.9 


64030-l_Kidney_none 


3.0 



Expression of gene CG50889J02 in two runs of panel L3D was not reproducible and is not considered 
further. 

Expression of gene CG50889J02 in Panel 4D: There is 30-fold increase in the expression of 
5 CG50889_02 in activated naive T cells (CD4+ CD45RA celb) as compared to resting CD4 cells. This 
protein is expressed both in resting and activated fibroblasts, endothelium, and epithelium. 
Potential Role(s) of CG50889-02 in Inflammation 

MOL8b may be important in the initial activation of naive T cells. Activated T cells 
initiate the inflammatory process by secreting cytokines and chemokines, which in turn induce B 
10 cell antibody production leading to the extravasation of leukocytes into inflammatory sites. 
Impact of Therapeutic Targeting of MOL8b: 

Antibody or small molecule therapeutics to MOL8b may block T cell activation in 

response to tissue transplant and reduce or block rejection. These therapeutic drugs may also 

reduce or prevent inflammation in asthma/allergy, psoriasis, arthritis and diabetes in which 

187 



WO 01/81578 



PCT/US01/13578 



activated T ceDs play a pivotal role. Antibodies to MOL8b may also serve as a diagnostic or 
experimental tool to identify and differentiate naive activated T cells from more differentiated T 
cell population (memory T cells). 

MOL9a 

5 Expression of gene MOL9a was assessed using the primer-probe set Ag673, described in 

Table 29. Results of the RTQ-PCR runs are shown in Tables 30 and 3 1 . 

Table 29. Probe name: Ag673 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5'-TAGAGTTGGTGGTTCCAGGATT-3' 


59.9 


22 


6 


97 


Probe 


FAM-5'- 

TGATGTCTCCTCTTCAGGCAATGTCT- 
3'-TAMRA 


66.6 


26 


56 


98 


Reverse 


5'-TCTGCCAGCCACAGTATAGG-3' 


58.9 


20 


83 


99 



10 



Table 30. Panels 1.3D and 2D 



PANEL 1 JD 




PANEL 2D 




Tissue Name 


Rel. Expr., % 
1.3dx4tm5796f 
_ag673_bl 


Tissue Name 


Rel. Expr., % 
2Dtm2310f ag6 
73 


Liver adenocarcinoma 


23.4 


Normal Colon GENPAK 
061003 


39.2 


Pancreas 


0.9 


83219 CC Well to Mod Diff 
(OD03866) 


15.0 


Pancreatic ca. 
CAP AN 2 


8.4 


83220 CC NAT (OD03866) 


15.1 


Adrenal gland 


0.2 


83221 CC Gr.2 rectosigmoid 
(OD03868) 


13.1 


Thyroid 


0.9 


83222 CCNAT(OD03868) 


3.9 


Salivary gland 


0.2 


83235 CC Mod Diff 
(ODO3920) 


20.3 


Pituitary gland 


0.4 


83236 CC NAT (ODO3920) 


7.6 


Brain (fetal) 


8.2 


83237 CC Gr.2 ascend colon 
(OD03921) 


36.3 


Brain (whole) 


4.0 


83238 CC NAT (OD03921) 


4.8 


Brain (amygdala) 


1.5 


83241 CC from Partial 
Hepatectomy (ODO4309) 


30.6 


Brain (cerebellum) 


22 


83242 Liver NAT (ODO4309) 


20.3 
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Brain (hippocampus) 


2.9 


87472 Colon mets to lung 
(OD04451-01) 


41.5 


Brain (substantia nigra) 


1.6 


87473 Lung NAT (OD0445 1 - 
02) 


12.1 


Brain (thalamus) 


1.6 


Normal Prostate Clontech A+ 
6546-1 


25.5 


Cerebral Cortex 


5.3 


84140 Prostate Cancer 
(OD04410) 


22.5 


Spinal cord 


1.7 


84141 Prostate NAT 
(OD04410) 


23.2 


CNS ca. (glio/astro) 
U87-MG 


6.5 


87073 Prostate Cancer 
(OD04720-01) 


10.7 


CNSca. (glio/astro) 
U-118-MG 


8.5 


87074 Prostate NAT 
(OD04720-02) 


24.5 


CNS ca. (astro) 
SW1783 


21.6 


Normal Lung GENPAK 
061010 


27.0 


CNS ca.* (neuro; met ) 
SK-N-AS 


12.6 


83239 Lung Met to Muscle 
(OD04286) 


63.3 


CNS ca. (astro) 
SF-539 


3.9 


83240 Muscle NAT 
(OD04286) 


50.7 


CNS ca. (astro) 
SNB-75 


18.7 


84136 Lung Malignant Cancer 
(OD03126) 


42.3 


CNS ca. (glio) 
SNB-19 


4.3 


84137 Lung NAT (OD03126) 


29.1 


CNS ca. (glio) 
U251 


17.7 


84871 Lung Cancer 

f/~\T\ A A A A A \ 

(OD04404) 


69.3 


CNS ca. (glio) 
SF-295 


12.6 


84872 Lung NAT (OD04404) 


• 26.8 


Heart (fetal) 


0.2 


84875 Lung Cancer 
(OD04565) 


19.1 


Heart 


0.6 


84876 Lung NAT (OD04565) 


12.2 


Fetal Skeletal 


0.8 


85950 Lung Cancer 
(OD04237-01) 


94.0 


Skeletal muscle 


6.5 


85970 Lung NAT (OD04237- 
02) 


20.3 


Bone marrow 


0.7 


83255 Ocular Mel Met to 
Liver (ODO4310) 


74.2 


Thymus 


0.4 


83256 Liver NAT (OD043 10) 


40.1 


Spleen 


0.3 


84139 Melanoma Mets to 
Lung (OD04321) 


36.3 


Lymph node 


0.8 


84138 Lung NAT (OD04321) 


29.7 


Colorectal 


1.2 


Normal Kidney GENPAK 
061008 


62.4 


Stomach 


0.2 


83786 Kidney Ca, Nuclear 
grade2(OD04338) 


66.0 


Small intestine 


0.4 


83787 Kidney NAT 
(OD04338) 


23.3 
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Colon ca. 

SW480 i 


10.2 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


34.9 


Colon ca*(SW480 
met)SW620 


59.0 


83789 Kidney NAT 
(OD04339) 


31.0 


Colon ca. 
HT29 


8.4 


83790 Kidney Ca, Clear cell 
type (OD04340) 


45.7 


Colon ca. ! 
HCT-116 


14.7 


83791 Kidney NAT 
(OD04340) 


33.0 


Colon ca. 
CaCo-2 


18.6 


83792 Kidney Ca, Nuclear 
grade3(OD04348) 


18.3 


83219 CC Well to Mod 
Diff(OD03866) 


2.2 


83793 Kidney NAT 
(OD04348) 


42.9 


Colon ca. 
HCC-2998 


16.6 


87474 Kidney Cancer 
(OD04622-01) 


14.5 


Gastric ca.* (liver met) 
NCI-N87 


12.3 


87475 Kidney NAT 
(OD04622-03) 


3.9 


Bladder 


5.9 


85973 Kidney Cancer 
(OD04450-01) 


70.2 


Trachea 


0.3 


85974 Kidney NAT 
(OD04450-03) 


31.6 


Kidney 


1.8 


Kidney Cancer Clontech 
8120607 


11.4 


Kidney (fetal) 


7.8 


Kidney NAT Clontech 
8120608 


7.1 


Renal ca. 
786-0 


8.5 


Kidney Cancer Clontech 
8120613 


8.2 


Renal ca. 
A498 


7.0 


Kidney NAT Clontech j 
8120614 


8.1 


Renal ca. 
RXF393 


17.6 


Kidney Cancer Clontech 
9010320 


18.3 


Renal ca. 
ACHN 


3.9 


Kidney NAT Clontech 
9010321 


33.4 


Renal ca. 
UO-31 


12.8 


Normal Uterus GENPAK 
061018 


11.8 


Renal ca. 
TX-10 


23.4 


Uterus Cancer GENPAK 
064011 


30.1 


Liver 


1.4 


Normal Thyroid Clontech A+ 
6570-1 


17.3 


Liver (fetal) 


1.0 


Thyroid Cancer GENPAK 
064010 


39.2 


Liver ca. (hepatoblast) 
HepG2 


9.3 


Thyroid Cancer 
ENVITROGEN A302152 


18.7 


Lung 


77.4 


Thyroid NAT EWITROGEN 
A302153 


18.7 


Lung (fetal) 


1.7 


Normal Breast GENPAK 
061019 


32.3 


Lung ca. (small cell) 
LX-1 


57.5 


84877 Breast Cancer 
(OD04566) 


40.6 
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Lung ca, (small cell) 
NCI-H69 


11.7 


85975 Breast Cancer 
(OD04590-01) 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


100.0 


85976 Breast Cancer Mets 
(OD04590-03) 


76.3 


Lung ca. (large 
ceU)NCI-H460 


6.3 


87070 Breast Cancer 
Metastasis (OD04655-05) 


73.2 


Lung ca. (non-sm. cell) 
A549 


28.7 


GENPAK Breast Cancer 
064006 


46.0 


Lung ca. (non-s.cell) 
NCI-H23 


4.8 


Breast Cancer Res. Gen. 1024 


312 


Lung ca (non-s.cell) 
HOP-62 


3.1 


Breast Cancer Clontech 
9100266 


41.5 


Lung ca. (non-s.cl) 
NCI-H522 


31.7 


Breast NAT Clontech 
9100265 


13.4 


Lung ca. (squam.) 
SW 900 


19.4 


Breast Cancer INVITROGEN 
A209073 


39.2 


Lung ca. (squam.) 
NCI-H596 


27.4 


Breast NAT INVITROGEN 
A2090734 


39.8 


Mammary gland 


3.7 


Normal Liver GENPAK 
061009 


20.9 


Breast ca * (pi. 
efiusion) MCF-7 


10.6 


Liver Cancer GENPAK 
064003 


15.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


6.6 


Liver Cancer Research 
Genetics RNA 1025 


17.2 


Breast ca * (pi. 
effusion) T47D 


17.8 


Liver Cancer Research 
Genetics RNA 1026 


9.6 


Breast ca. 
BT-549 


15.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


42.9 


Breast ca. 
MDA-N 


5.6 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


18.6 


Ovary 


1.4 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6005-T 


12.2 


Ovarian ca. 
OVCAR-3 


8.4 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


8.1 


Ovarian ca. 
OVCAR-4 


2.6 


Normal Bladder GENPAK 
061001 


33.7 


Ovarian ca. 
OVCAR-5 


44.9 


Bladder Cancer Research 
Genetics RNA 1023 


5.7 


Ovarian ca. 
OVCAR-8 


5.0 


Bladder Cancer 
INVITROGEN A302173 


23.5 


Ovarian ca. 
IGROV-1 


5.2 


87071 Bladder Cancer 
(OD04718-01) 


69.7 


Ovarian ca.* (ascites) 
SK-OV-3 


50.1 


87072 Bladder Normal 
Adjacent (OD047 18-03) 


20.6 


Uterus 


0.7 


Normal Ovary Res. Gen. 


2.9 


Plancenta 


0.0 


Ovarian Cancer GENPAK 
064008 


18.6 
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Prostate 


0.4 


87492 Ovary Cancer 
(OD04768-07) 


53.2 


Prostate ca* (bone 
met)PC-3 


1.9 


87493 Ovary NAT 
(OD04768-08) 


13.5 


Testis 


0.3 


Normal Stomach GENPAK 
061017 


15.0 


Melanoma 
Hs688(A).T 


2.1 


Gastric Cancer Clontech 
9060358 


2.5 


Melanoma* (met) 
Hs688(B).T 


2.3 


NAT Stomach Clontech 
9060359 


8.3 


Melanoma 
UACC-62 


5.7 


Gastric Cancer Clontech 
9060395 


19.1 


Melanoma 
M14 


5.6 


NAT Stomach Clontech 
9060394 


4.6 


Melanoma 
LOXIMVI 


12.5 


Gastric Cancer Clontech 
9060397 


29.5 


Melanoma* (met) SK- 
MEL-5 


11.3 


NAT Stomach Clontech 
9060396 


3.7 


Adipose 


3.2 


Gastric Cancer GENPAK 
064005 


9.9 



Table 31. Panels 3D and 4D 



PANEL 3D 




PANEL 4D 




Tissue Name 


Rel. Expr., % 
3dx4tm5137f 
_ag673_bl 


Tissue Name 


Rel. Expr., 
% 

4dtm4833f 
ag673 


94905_Daoy_Medulloblasto 
ma/Cerebellum sscDNA 


13.6 


93768_Secondary Thl_anti- 
CD28/anti-CD3 


11.9 


94906_TE671_Medulloblast 
om/Cerebellum sscDNA 


11.6 


93769_Secondary Th2_anti- 
CD28/anti-CD3 


8.2 


94907_D283 

Med_Medulloblastoma/Cere 
bellum sscDNA 


53.3 


93770 Secondary Trl anti- 
CD28/anti-CD3 


6.6 


94908_PFSK-l_Primitive 
Neuroectodermal/Cerebellum 
sscDNA 


9.3 


93573_Secondary Thl_resting 
day 4-6 in IL-2 


0.0 


94909 XF- 

498 CNS sscDNA 


7.8 


93572_Secondaiy Th2_resting 
day 4-6 in IL-2 


0.8 


94910_SNB- 
78_CNS/glioma_sscDNA 


8.6 


93571_Secondary Trl_resting 
day 4-6 in IL-2 


0.5 


9491 1_SF- 

268 CNS/glioblastoma sscD 
NA 


14.1 


93568 j>rimary Thl_anti- 
CD28/anti-CD3 


17.7 


94912_T98G_Glioblastoma_ 
sscDNA 


15.3 


93569_primary Th2_anti- 
CD28/anti-CD3 


9.1 


96776_SK-N- 

SH Neuroblastoma 


15.1 


93570_primary Trl_anti- 
CD28/anti-CD3 


16.3 
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94913JSF- 

295_CNS/glioblastoma_sscD 
NA 


13.6 


93565_primary Thl_resting dy 
4-6 in IL-2 


7.9 


949 14_Cerebellum_sscDNA 


5.5 


93566_primary Th2_resting dy 
4-6 in IL-2 


2.6 


96777_Cerebellum_sscDNA 


0.9 


93567_primary Trl_resting dy 
4-6 in IL-2 


4.3 


94916_NCI- 

H292_Mucoepidermoid lung 
carcinoma sscDNA 


36.5 


93351_CD45RA CD4 
lymphocyte anti-CD28/anti- 
CD3 


13.4 


94917_DMS-1 14_Small cell 
lung cancer_sscDNA 


8.1 


93352_CD45RO CD4 
lymphocyte anti-CD28/anti- 
CD3 


8.9 


94918_DMS-79_Small cell 
lung 

cancer/neuroendocrine sscD 
NA 


100.0 


93251_CD8 

Lymphocytes anti-CD28/anti- 
CD3 


12.9 


94919_NCI-H146_Small cell 
lung 

cancer/neuroendocrine sscD 
NA 


32.1 


93353_chronic CD8 
Lymphocytes 2ry resting dy 4- 
6 in IL-2 


10.5 


94920_NCI-H526_Small cell 
lung 

cancer/neuroendocrine sscD 
NA 


30.0 


93574_chronic CD8 
Lymphocytes 2ry activated 
CD3/CD28 


5.2 


94921_NCI-N417_Small cell 
lung 

cancer/neuroendocrine sscD 
NA 


30.6 


93354_CD4_none 


1.4 


94923_NCI-H82_Small cell 
lung 

cancer/neuroendocrine sscD 
NA 


22.6 


93252 Secondary 
Thl/Th2/Trl_anti-CD95 CH11 


0.8 


94924_NCI- 

H157_Squamous cell lung 
cancer (metastasis)_sscDNA 


67.1 


93 103_LAK cells_resting 


18.0 


94925_NCI-H1 155_Large 
cell lung 

cancer/neuroendocrine sscD 
NA 


19.8 


93788_LAKceUs_IL-2 


7.8 


94926_NCI-H1299_Large 
cell lung 

cancer/neuroendocrine sscD 
NA 


36.3 


93787_LAK ceUs_EL-2+IL-12 


8.1 


94927_NCI-H727_Lung 
carcinoid sscDNA 


20.2 


93789_LAK ceUs_IL-2+IFN 


13.6 


94928 JNCI-UMC-1 l_Lung 
carcinoid sscDNA 


73.3 


93790_LAK cells_IL-2+ EL-18 


10.9 
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94929 LX-1 Small cell lung 
cancer_sscDNA 


202 


93104 LAK 

cells PMA/ionomycin and IL- 
18 


4.8 


94930_Colo-205_Colon 
cancer sscDNA 


11.6 


93578_NK Cells IL-2_restmg 


1.7 


94931_KM12_Colon 
cancer sscDNA 


31.6 


93 1 09_Mixed Lymphocyte 
Reaction Two Way MLR 


13.0 


94932 JCM20L2_Colon 
cancer sscDNA 


8.5 


931 10_Mixed Lymphocyte 
Reaction_Two Way MLR 


11.4 


94933_NCI-H716_Colon 
cancer sscDNA 


36.9 


93 1 1 l_Mixed Lymphocyte 
Reaction Two Way MLR 


3.2 


94935_SW^8_Colon 
adenocarcinoma sscDNA 


12.5 . 


931 ^Mononuclear Cells 
(PBMCs)_resting 


1.4 


94936_SW1116_Colon 
adenocarcinoma sscDNA 


9.0 


93113 Mononuclear Cells 
(PBMCs)_PWM 


23.7 


94937_LS 174T_Colon 
adenocarcinoma sscDNA 


32.6 


93114 Mononuclear Cells 
(PBMCs) PHA-L 


8.7 


94938_SW-948_Colon 
adenocarcinoma sscDNA 


1.5 


93249_Ramos (B cell)_none 


19.9 


94939_SW-480_Colon 
adenocarcinoma sscDNA 


6.7 


93250_Ramos (B j 
cell) ionomycin 


100.0 


94940 NCI-SNU-5 Gastric 
carcinoma sscDNA 


12.4 


93349 B lymphocytes PWM 


65.1 


94941JCATO ffl_Gastric 

carcinoma sscDNA 


55.9 


93350_B lymphoytes_CD40L 
and IL-4 


6.0 


94943_NCI-SNU-16_Gastric 
carcinoma sscDNA 


12.2 


92665_EOL-l 
fEosinoDhil) dbcAMP 
differentiated 


12.5 


94944 NCI-SNU-1 Gastric 
carcinomajsscDNA 


58.1 


93248 EOL-1 

(Eosinophil^dbcAMP/PMAio 
nomycin 


4.4 


94946_RF-l_Gastric 
adenocarcinoma sscDNA 


11.4 


93356JDendritic Cells_none 


9.5 ! 


94947 JlF^8_Gastric 
adenocarcinoma sscDNA 


15.3 


93355 J^ndritic CeUs_LPS 
100 ng/ml 


8.7 


96778_MKN-45_Gastric 
carcinoma sscDNA 


33.2 


93775 Dendritic Cells anti- 
CD40 


12.2 


94949__NCI-N87_Gastiic 
carcinoma sscDNA 


16.7 


93774_Monocytes_resting 


14.8 


9495 l_OVCAR-5J3varian 
carcinoma sscDNA 


10.0 


93776 JVlonocytes_LPS 50 
ng/ml 


15.5 


94952 RL95-2 Uterine 
carcinoma sscDNA 


7.6 


93581 Macrophages resting 


35.4 


94953_HelaS3_Cervical 
adenocarcinoma sscDNA 


11.1 


93582 JVlacrophages_LPS 100 
ng/ml 


A A 

4.4 


94954_Ca Ski_Cervical 
epidermoid carcinoma 
(metastasis) sscDNA 


28.9 


93098_HUVEC 
(Endothelial)_none 


27.0 


94955_ES-2_Ovarian clear 

: „ „„ 0 n\T a 


15.5 


93099_HUVEC 


39.2 
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cell carcinoma_sscDNA 




(Endomekal)_starved 




94957Jtamos/6h stimj'; 
Stimulated with 
PMA/ionomycin 6h sscDNA 


15.8 


9310O_HUVEC 
(Endothelial)JL-lb 


12.7 


94958_Ramos/14h stim_ M ; 
Stimulated with 
PMA/ionomycin 
14h sscDNA 


12.4 


93779 JfUVEC 
(Endothelial)_IFN gamma 


17.1 


94962_MEG-01J3ironic 
myelogenous leukemia 
(megokaryoblast)_sscDNA 


25.5 


93102 JIUVEC 
(Endomelial) JIOT alpha + 
IFN gamma 


14.0 


94963 JtajiJ3urkitt , s 
lymphoma_sscDNA 


8.4 


93101JIUVEC 
(Endothelial)_TNF alpha + IL4 


22.4 


94964 J^auoUJBuridtfs 
Iymphoma_sscDNA 


29.1 


93781_HUVEC 
(EndothelialLIL-11 


11.7 


94965_U266_B-cell 
plasmacytoma/myeloma ssc 
DNA 


8.0 


93583_Lung Microvascular 
Endothelial Cells jione 


14.7 


94968 j:A46_Burkitt's 
lymphoma_sscDNA 


12 


93584_Lung Microvascular 
Endothelial Cells_TNFa (4 
ng/ml) and ELlb (1 ng/ml) 


19.2 


94970_RL_non-Hodgkin , s B- 
cell lymphomasscDNA 


10.7 


92662_Microvascular Dermal 
endothelium_none 


26.8 


94972_JMl_j>re-B-cell 
lymphoma/leukemia sscDN 
A 


5.5 


92663_Microsvasular Dermal 
endomelium JTNFa (4 ng/ml) 
andlLlb (1 ng/ml) 


18.2 


94973 Jurkat^T cell 
leukemia_sscDNA 


15.5 


93773 JBronchial i 
epithelium TNFa(4ng/ml) 
andlLlb (1 ng/ml)** 


2.4 


94974JTF- 

l_Eiythroleukemia_sscDNA 


33.9 


93347_Small Airway 
Epithelium none 


8.7 


94975 JHUT78_T-cell 
lyrnphoma_sscDNA 


23.7 


93348_Small Airway 
Epithelium_TNFa (4 ng/ml) 
andILlb(l ng/ml) 


37.9 


94977_U937_Histiocytic 
lymphomasscDNA 


29.4 


92668_Coronery Artery 
SMC resting 


17.4 


94980JOJ- 

8 1 2_Myelogenous 

leukemia sscDNA 


27.2 


92669_Coronery Artery 
SMC TNFa (4 ng/ml) and 
ELlb (1 ng/ml) 


7.4 


94981_769-P_Clearcell 
renal carcinoma sscDNA 


17.3 


93 1 07_astrocytes_resting 


13.1 


94983_Caki-2_Clear cell 
renal carcinoma sscDNA 


19.9 


93108_astrocytes_TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


9.3 


94984_SW 839_Clearcell 
renal carcinoma sscDNA 


15.4 


92666_KU-812 
(Basophil)_resting 


13.0 


94986 Jj401JWilms' 
tumor sscDNA 


19.5 


92667JCU-812 
(Basophil)_PMA/ionoycin 


26.8 


94987 - Hs766T - Pancreatic 
carcinoma (LN 


12.7 


93579JXD1106 
(Keratinocytes)none 


14.9 
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metastasis)_sscDNA 








94988_CAPAN-l_Pancreatic 
adenocarcinoma (liver 
metastasis) sscDNA 


11.1 


93580JXD1106 
(Keratmocytes)_TNFa and 
IFNg ** 


1.4 


94989_SU86.86JPancreatic 
carcinoma (liver 
metastasis)_sscDNA 


38.5 


93791_Liver Cirrhosis 


2.3 


94990_BxPC-3_Pancreatic 
adenocarcinoma sscDNA 


8.2 


93792_Lupus Kidney 


1.0 


9499 l_HPAC_Pancreatic 
adenocarcinoma sscDNA 


24.3 


93577_NCI-H292 


12.3 


94992 JVOAPaCa- 
2_Pancreatic 
carcinoma sscDNA 


5.6 


93358 J*CI-ffi92J0L-4 


20.0 


94993 J3FPAC-lJPancreatic 
ductal 

adenocarcinoma sscDNA 


32.2 


93360_NCI-H292JL-9 


31.4 


94994 j>ANC-lj>ancreatic 

epithelioid ductal 
carcinoma_sscDNA 


29.0 


93359_NCI-H292_IL-13 


13.7 


94996_T24_Bladder 
carcinma (transitional 
cell)_sscDNA 


13.0 


93357 J4CI-H292JFN gamma 


15.2 


94997_5637_Bladder 
carcinoma sscDNA 


18.4 


93777_HPAEC_- 


14.7 


94998 JIT- 1 197JBladder 
carcinoma sscDNA 


25.1 


93778JKPAECJL-1 
beta/TNA alpha 


12.7 


94999 JJM-UC-3JBladder 
carcinma (transitional 
cell)_sscDNA 


3.3 


93254_Normal Human Lung 
Fibroblast_none 


8.0 


95000_A204_Rhabdomyosar 
coma__sscDNA 


16.1 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and 
IL-lb (1 ng/ml) 


3.6 


95001_HT- 

1080 Fibrosarcoma sscDNA 


16.7 


93257_Normal Human Lung 
Fibroblast IL-4 


18.7 


95002 JV1G- 
63_Osteosarcoma 
(bone) sscDNA 


12.9 


93256_Nonnal Human Lung 
Fibroblast_IL-9 


13.5 


95003_SK-LMS- 

l_Leiomyosarcoma 

(vulva)_sscDNA 


34.7 


93255_Normal Human Lung 
Fibroblast JL-1 3 


10.2 


95004_SJRH30_Rhabdomyo 
sarcoma (met to bone 
marrow) sscDNA 


16.9 


93258_Nonnal Human Lung 
FibroblastJQFN gamma 


17.2 


95005_A43 l_Epidermoid 
carcinoma sscDNA 


9.4 


93106_DermaI Fibroblasts 
CCD 1070 resting 


39.0 


95007_WM266- 

4 Melanoma sscDNA 


6.9 


93361_Dennal Fibroblasts 
CCD1070JTNF alpha 4 ng/ml 


47.0 
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95010J3U 145J>rostate 
carcinoma (brain 
metastasis) sscDNA 


0.3 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


15.7 


95012_MDA-MB- 
468J8reast 

adenocarcinoma sscDNA 


13.4 


93772_deimal fibroblastJFN 
gamma 


5.6 


95013_SCC-4_Squamous 
cell carcinoma of 
tonguesscDNA 


0.1 


93771_dermal fibroblastJDL-4 


11.7 


950 14_SCC-9__Squamous 
cell carcinoma of 
tongue sscDNA 


0.2 


93259JBD Colitis 1** 


0.1 


95015_SCC-15_Squamous 
cell carcinoma of 
tongue sscDNA 


0.8 


93260 JBD Colitis 2 


0.3 


95017_CAL 27_Squamous 
cell carcinoma of 
tongue sscDNA 


29.0 


93261 JBD Crohns 


0.5 






735010_Colon_nonnal 


3.1 






735019_Lun£_none 


7.1 






64028-l_Thymus_none 


9.4 






64030-l_Kidney_none 


3.0 



Panel J. 3D description: The gene MOL9a is expressed in a number of tissues, including the centred 
nervous system, lung, mammary gland and kidney. Moreover, its expression seems to be enhanced in 
tumor cell lines as compared to normal tissue in most cases with a good therapeutic window. 
5 Panel 2D description: Tissue distribution of this gene MOL9a in panel 2D confirms the results obtained 
in panel 1.3 D. There is enhanced expression of this gene MOL9a in tumor tissue as against the normal 
adjacent tissue, particularly in lung and kidney cancer, but also in cases of colon cancer, ovarian cancer, 
breast cancer, gastric cancer, bladder cancer, liver cancer and thyroid cancer. Some metastases, 
particularly tlie lung and those from melanoma express high levels ofMOISa. Corroborative information 
10 about the expression of this molecule is available in die form ofESTs, mosdy from endothelial cells, 

colon, ovarian tumors, pancreas and brain regions. Panel 3D demonstrates increased expression of this 
gene MOL9a in a variety of carcinomas, supporting the results of panels 1 3D and 2D thus denxonstrating 
utility for this protein as an antibody target Uierefore antibodies specific to this protein may be used as 
a therapeutic in the treatment of various types of cancer. 

15 

Panel 4D Description: This gene MOL9a is upregulated in endothelium, and epithelium 
regardless of stimulus. There is also high level expression of this protein in ionomycin-treated B 
cell line and mitogen (pokeweed mitogen, PWM) treated B cells. Consistent with this finding 
Peripheral blood mononuclear cells (PBMC) treated with PWM also demonstrate increased 
20 expression of this molecule. Further, induction of MOL9a is seen in activated T cells. In PBMC 
the T cell specific mitogen (Phytohemagglutinin, PHA) induces the expression of this transcript 
and in acute and chronically activated T cells the expression of mis transcript is increased as 
compared to untreated or resting T cells. 
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The MOL9a is induced in activated B and T lymphocytes and may thus have a potential 
role in inflammation by regulating lymphocyte trafficking, or activation, or increasing tissue 
destruction. This molecule may also serve as a marker for activated T or B cells. 

Impact of Therapeutic Targeting of MOL9a: Small molecule or antibody therapies to the 
5 molecule encoded by MOL9a may inhibit tissue damage due to T or B cell activation and the 
bioactive molecules produced by these cell types. These diseases would include asthma/allergy, 
colitis, Crohn's disease, lupus, and arthritis. Alternatively, protein therapeutics based on this 
molecule could serve as an adjuvant and help boost the effectiveness of vaccines or regulate 
immune status during organ transplant 1 95067 19_B_EXT may also serve as a marker for 
10 activated T cells and serve as a diagnostic tool in determining the extent of inflammation in 
autoimmune diseases such as asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. 



MOL9b 

Expression of gene MOL9b was assessed using the primer-probe set Ag2458, described 
15 in Table 32. Results of the RTQ-PCR runs are shown in Tables 33 and 34. 



Table 32. Probe name: Ag2458 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQBD 
NO: 


Forward 


5'-GTTGGTGGTTCC AGGATTTT-3 ' 


58.7 


20 


58 


100 


.Probe 


TET-5'- 

TGATGTCTCCTCTTCAGGCAATGTCT- 
3'-TAMRA 


66.6 


26 


104 


101 


Reverse 


5-CTGCCAGCCACAGTATAGGA-3' 


58.9 


20 


130 


102 



20 Table 33. Panels 1.3D and 2D 



PANEL 1.3D 






t'ANEL 2D 




Tissue Name 


Rel. Expr., 
% 

l.3dtm4270t 
_ag2458 


Rel. Expr., 
% 

I.3dx4tm54 
07t ag2458 
a2 


Tissue Name 


Rel. Expr., 
% 

2dtm4271t 
ag2458 


Liver 

adenocarcinoma 


33.9 


41 .4 


Normal Colon GENPAK 
061003 


62.4 


Pancreas 


1.3 


0.8 


83219 CC Well to Mod Diff 
(OD03866) 


16.8 


Pancreatic ca. 


6.9 


44.3 


83220 CC NAT (ODQ3866) 


11.7 
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CAP AN 2 










Adrenal gland 


0.7 


1.1 


83221 CC Gr.2 rectosigmoid 
(OD03868) 


18.6 


Thyroid 


2.8 


3.3 


83222 CCNAT(OD03868) 


8.4 


Salivary gland 


0.6 


2.0 


83235 CCModDiff 
(ODO3920) 


34.2 


Pituitary gland 


1.6 


0.4 


83236 CC NAT (ODO3920) 


11.3 


Brain (fetal) 


9.2 


19.7 


83237 CC Gr.2 ascend colon 


74.7 


Brain (whole) 


5.4 


17.7 


83238 CC NAT (OD03921) 


9.6 


Brain (amygdala) 


6.7 


9.0 


83241 CC from Partial 
Hepatectomy (ODO4309) 


35.6 


Brain (cerebellum) 


2.7 


10.2 


83242 Liver NAT 
(ODO4309) 


32.3 


Brain 

(hippocampus) 


23.0 


13.1 


87472 Colon mets to lung 
(OD04451-01) 


29.7 


Brain (substantia 
nigra) 


1.6 


7.2 


87473 Lung NAT 
(OD04451-02) 


6.4 


Brain (thalamus) 


4.9 


16.6 


Normal Prostate Clontech 
A+ 6546-1 


8.8 


Cerebral Cortex 


26.2 


8.2 


84140 Prostate Cancer 
(OD04410) 


25.9 


Spinal cord 


2.4 


9.8 


84141 Prostate NAT 
(OD04410) 


27.9 


CNS ca. (glio/astro) 
U87-MG 


11.0 


19.7 


87073 Prostate Cancer 
(OD04720-01) 


15.8 


CNSca. 

(glio/astro) U-118- 
MG 


45.1 


82.4 


87074 Prostate NAT 
(OD04720-02) 


28.1 


CNS ca. (astro) 
SW1783 


21.9 


46.0 


Normal Lung GENPAK 
061010 


24.8 


CNS ca.* (neuro; 
met)SK-N-AS 


73.7 


40.2 


83239 Lung Met to Muscle 
(OD04286) 


100.0 


CNS ca. (astro) 
SF-539 


9.9 


12.0 


83240 Muscle NAT 
(OD04286) 


31.9 


CNS ca. (astro) 
SNB-75 


22.5 


71.5 


84136 Lung Malignant 
Cancer (OD03126) 


29.5 


CNS ca. (glio) 
SNB-19 


6.2 


18.1 


84137 Lung NAT 
(OD03126) 


27.9 


CNS ca. (glio) 
U251 


5.9 


45.0 


84871 Lung Cancer 
(OD04404) 


71.7 


CNS ca. (glio) 
SF-295 


13.0 


26.4 


84872 Lung NAT 
(OD04404) 


15.1 


Heart (fetal) 


1.7 


0.0 


84875 Lung Cancer 
(OD04565) 


28.5 


Heart 


1.2 


4.4 


84876 Lung NAT 
(OD04565) 


13.1 
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Fetal Skeletal 


11.4 


1.2 


85950 Lung Cancer 
(OD04237-01) 


90.1 


Skeletal muscle 


3.8 


41.7 


85970 Lung NAT 
(OD04237-02) 


16.8 


Bone marrow 


1.5 


2.2 


83255 Ocular Mel Met to 
Liver (ODO4310) 


76.3 


Thymus 


0.8 


0.4 


83256 Liver NAT 
(ODO4310) 


26.8 


Spleen 


0.9 


0.8 


841 39 Melanoma Mets to 
Lung(OD04321) 


36.3 


Lymph node 


1.3 


10.4 


84138 Lung NAT 
(OD04321) 


22.5 


Colorectal 


4.8 


3.1 


Normal Kidney GENPAK 
061008 


33.7 


Stomach 


0.0 


3.1 


83786 Kidney Ca, Nuclear 
grade 2 (OD04338) 


68.8 


Small intestine 


1.1 


2.3 


83787 Kidney NAT 
(OD04338) 


33.4 


Colon ca. 
SW480 


19.5 


18.9 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


30.6 


Colon ca.*(SW480 
met)SW620 


29.9 


29.8 


83789 Kidney NAT 
(OD04339) 


27.2 


Colon ca. 
HT29 


17.7 


9.7 


83790 Kidney Ca, Clear cell 
typeiOD04340) 


49.3 


Colon ca. 
HCT-116 


22.1 


43.8 


83791 Kidney NAT 
(OD04340) 


32.1 


Colon ca. 
CaCo-2 


13.5 


18.4 


83792 Kidney Ca, Nuclear 
grade 3 (OD04348) 


16.0 


83219 CC Well to 

ModDiff 

(OD03866) 


10.6 


7.0 


83793 Kidney NAT 
(OD04348) 


35.4 


Colon ca. 
HCC-2998 


45.7 


21.1 


87474 Kidney Cancer 
(OD04622-01) 


13.2 


Gastric ca.* (liver 
met) NCI-N87 


38.4 


69.5 


87475 Kidney NAT 
(OD04622-03) 


4.0 


Bladder 


7.4 


13.6 


85973 Kidney Cancer 
(OD04450-01) 


48.6 


Trachea 


2.9 


3.0 


85974 Kidney NAT 
(OD04450-03) 


30.4 


Kidney 


1.3 


3.5 


Kidney Cancer Clontech 
8120607 


20.4 


Kidney (fetal) 


3.6 


3.9 


Kidney NAT Clontech 
8120608 


7.3 


Renal ca. 
786-0 


10.0 


19.0 


Kidney Cancer Clontech 
8120613 


14.4 


Renal ca. 
A498 


29.7 


28.6 


Kidney NAT Clontech 
8120614 


8.8 


Renal ca. 
RXF393 


5.6 


53.0 


Kidney Cancer Clontech 
9010320 


12.2 
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Renal ca. 
ACHN 


5.5 


13.3 


Kidney NAT Clontech 
9010321 


22.1 


Renal ca. 
UO-31 


21.6 


45.5 


Normal Uterus GENPAK 
061018 


8.3 


Renal ca. 
TK-10 


20.4 


27.0 


Uterus Cancer GENPAK 
064011 


15.3 


Liver 


2.9 


2.4 


Normal Thyroid Clontech 
A+ 6570-1 


15.1 


Liver (fetal) 


3.8 


5.0 


Thyroid Cancer GENPAK 
064010 


33.0 


Liver ca. 

(hepatoblast) 

HepG2 


17.3 


28.2 


Thyroid Cancer 
INVrTROGENA302152 


21.6 


Lung 


1.9 


2.5 


Thyroid NAT 
INVTTROGEN A3021 53 


14.4 


Lung (fetal) 


1.4 


5.6 


Normal Breast GENPAK 
061019 


33.2 


Lung ca. (small 
cell) LX-1 


11.8 


40.6 


84877 Breast Cancer 
(OD04566) 


44.8 


Lung ca. (small 
cell) NCI-H69 


31.4 


44.0 


85975 Breast Cancer 
(OD04590-01) 


95.9 


Lung ca. (s.cell 
var.) SHP-77 


69.3 


90.5 


85976 Breast Cancer Mets 
(OD04590-03) 


61.1 


Lung ca. (large 
cell)NCI-H460 


9.9 


63.6 


87070 Breast Cancer 
Metastasis (OD04655-05) 


38.4 


Lung ca. (non-sm. 
cell)A549 


20.9 


25.9 


GENPAK Breast Cancer 
064006 


33.2 


Lung ca. (non- 
s.cell) NCI-H23 


11.3 


10.7 


Breast Cancer Res. Gen. 
1024 


23.0 


Lung ca (non-s.cell) 
HOP-62 


4.1 


9.1 


Breast Cancer Clontech 
9100266 


33.4 


Lung ca. (non-s.cl) 
NCI-H522 


29.1 


30.0 


Breast NAT Clontech 
9100265 


19.5 


Lung ca. (squam.) 
SW900 


9.7 


20.1 


Breast Cancer 
INVITROGEN A209073 


47.0 


Lung ca. (squam.) 
NCI-H596 


9.5 


43.2 


Breast NAT INVITROGEN 
A2090734 


37.6 


Mammary gland 


5.3 


9.0 


Normal Liver GENPAK 
061009 


15.5 


Breast ca.* (pi. 
effusion) MCF-7 


14.3 


30.8 


Liver Cancer GENPAK 
064003 


14.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


42.3 


41.1 


Liver Cancer Research 
Genetics RNA 1025 


20.4 


Breast ca.* (pi. 
effusion) T47D 


9.6 


13.8 


Liver Cancer Research 
Genetics RNA 1026 


7.1 


Breast ca. 
BT-549 


100.0 


100.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


24.8 


Breast ca. 


17.2 


8.8 


Paired Liver Tissue Research 

n — t>xi a cc\(\a xt 


18.0 
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(Vnpfirc KNA 6004-N 




Ovary 


4.8 


0.9 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6005-T 


13.4 


Ovarian ca. 
OVCAR-3 


11.3 


20.9 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


7.3 


Ovarian ca: 
OVCAR-4 


2.5 


10.6 


Normal Bladder GENPAK 
061001 


59.9 


Ovarian ca. 
OVCAR-5 


20.0 


26.3 


Bladder Cancer Research 
Genetics RNA 1023 


5.2 


Ovarian ca. 
OVCAR-8 


18.6 


16.5 


Bladder Cancer 
INVTTROGEN A302 1 73 


30.1 


Ovarian ca. 
IGROV-1 


6.9 


6.8 


87071 Bladder Cancer 
COD047 18-01) 


65.5 


Ovarian ca.* 
/ascites) SK-OV-3 


32.3 


64.3 


87072 Bladder Normal 
Adiacent (OD04718-03) 


18.4 


Uterus 


1.8 


4.3 


Normal Ovary Res. Gen 


5.1 


Plancenta 


2.2 


1.1 


Ovarian Cancer GENPAK 
064008 


29.1 


Prostate 


2.0 


2.4 


87492 Ovary Cancer 
(OD04768-07) 


66.0 


Prostate ca.* (bone 
met)PC-3 


3.1 


3.2 


87493 Ovary NAT 
(OD04768-08) 


7.7 


Testis ' 


2.0 


1.5 


Normal Stomach GENPAK 
061017 


16.4 


Melanoma 
Hs688(A).T 


4.6 


5.4 


Gastric Cancer Clontech 
9060358 


3.5 


Melanoma* (met) 
Hs688(B).T 


2.4 


6.9 


NAT Stomach Clontech 
9060359 


10.5 


Melanoma 
UACC-62 


1.3 


12.6 


Gastric Cancer Clontech 
9060395 


29.9 


Melanoma 
M14 


5.9 


56.2 


NAT Stomach Clontech 
9060394 


12.9 


Melanoma 
LOXIMVI 


14.7 


8.1 


Gastric Cancer Clontech 
9060397 


42.3 


Melanoma* (met) 
SK-MEL-5 


16.2 


24.7 


NAT Stomach Clontech 
9060396 


6.2 


Adipose 


4.4 


3.4 


Gastric Cancer GENPAK 
064005 


31.0 



Table 34. Panels 3D and 4D 



PANEL 3D 




PANEL 4D 




Tissue Name 


Rel. Expr., 
% 

3dx4tm5121 
t ag2458 b 


Tissue Name 


Rel. Expr., 
% 

4dtm4272t 
ag2458 
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2 






94905_Daoy__Medulloblast 
oma/Cerebellum sscDNA 


13.6 


93768 Secondary Thl_anti- 
CD28/anti-CD3 


14.5 


94906JTE67 1 JVledullobla 
stom/Cerebellum sscDNA 


7.4 


93769_Secondary Th2_anti- 
CD28/anti-CD3 


7.4 


94907_D283 

Med_Medulloblastoma/Cer 
ebellum sscDNA 


53.0 


93770_Secondary Trl_anti- 
CD28/anti-CD3 


9.5 


94908 J>FSK4J>rimitive 
Neuroectodeimal/Cerebellu 
m sscDNA 


6.5 


93573_Secondary Thl_resting day 
4-6 in DL-2 


0.2 


94909 XF- 

498 CNS sscDNA 


6.8 


93572 Secondary Th2 resting day 
4-6inIL-2 


0.6 


94910JSNB- 

78 CNS/glioma_sscDNA 


9.8 


93571 Secondary Trl resting day 
4-6inIL-2 


0.8 


9491 1J5F- 

268 CNS/glioblastoma ssc 
DNA 


10.2 


93568_primary Thl_anti- 
CD28/anti-CD3 


19.8 


94912JT98GJ31ioblastom 
a sscDNA 


15.7 


93569_primary Th2_anti- 
CD28/anti-CD3 


13.0 


96776 JSK-N- 

SH_Neuroblastoma 

(metastasis)_sscDNA 


16.5 


93570_primary Trl_anti- 
CD28/anti-CD3 


19.2 


94913JSF- 

295 CNS/glioblastoma ssc 
DNA 


7.4 


93565_primary Thlresting dy 4-6 
inIL-2 


8.8 


94914 Cerebellum sscDN 
A 


3.9 


93566_primary Th2_resting dy 4-6 
inJDL-2 


2.2 


96777 Cerebellum sscDN 
A 


0.8 


93567_primary Trl_resting dy 4-6 
inIL-2 


3.6 


94916_NCI- 
H292_Mucoepidermoid 
lung carcinomasscDNA 


46.9 


93351_CD45RACD4 
lymphocyte_anti-CD28/anti-CD3 


19.5 


94917 J}MS-114_Small 
cell lung cancer_sscDNA 


10.9 


93352_CD45RO CD4 
lymphocyte_anti-CD28/anti-CD3 


17.7 ■ 


94918JDMS-79_Small cell 
lung 

cancei/neuroendocrine ssc 
DNA 


100.0 


93251_CD8 Lymphocytes_anti- 
CD28/anti-CD3 


9.3 


94919_NCI-H146_Small 
cell lung 

cancer/neuroendocrine ssc 
DNA 


33.4 


93353_chronic CD8 Lymphocytes 
2ry_resting dy 4-6 in IL-2 


11.0 


94920_NCI-H526_Small 
cell lung 

cancer/neuroendocrine ssc 
DNA 


30.2 


93574_cbronic CD8 Lymphocytes 
2ry_activated CD3/CD28 


5.5 


94921_NCI-N417_Small 
cell lung 


26.1 


93354_CD4_none 


0.7 
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cancer/neuroendocrine_ssc 
DNA 








94923_NCI-H82_Small 
cell lung 

cancei/neuroendocrine__ssc 
DNA 


28.4 


93252 Secondary 
Thl/Th2/Trl_anti-CD95 CH11 


0.9 


94924 JNCI- 

H157_Squamous cell lung 
cancer 

(metastasis)_sscDNA 


88.0 


93103 JLAK cells^resting 


15.8 


94925_NCI-H1 1 55_Large 
cell lung 

cancei/neuroendocrine ssc 
DNA 


31.3 


^3788_LAKcells_IL-2 


6.1 


94926 JO-H1299_Large 
cell lung 

cancer/neuroendocrine ssc 
DNA 


32.4 


93787_LAK cellsJL-2+IL-12 


8.8 


94927 J*CI-H727_Lung 
carcinoid_sscDNA 


23.0 


93789_LAK cells JL-2+IFN 
gamma 


11.7 


94928 JTCI-UMC- 
1 l_Lung 

carcinoid sscDNA 


75.0 


93790JLAK cells_IL-2+ IL-1 8 


13.2 


94929JLX-lJSmall cell 
lung cancersscDNA 


17.0 


93104JAK 

cells_PMA/ionomycin and IL-1 8 


8.5 


94930_Colo-205_Colon 
cancer sscDNA 


11.1 


93578_NK Cells rL~2_resting 


2.5 


94931_KM12_Colon 
cancer sscDNA 


37.3 


93109_Mixed Lymphocyte 
ReactionTwo Way MLR 


17.0 


94932_KM20L2j3olon 
cancer sscDNA 


7.8 


93 1 1 0_Mixed Lymphocyte 
ReactionTwo Way MLR 


10.2 


94933_NCI-H716_Colon 
cancer sscDNA 


32.3 


93 1 1 l__Mixed Lymphocyte 
ReactionTwo Way MLR 


7.4 


94935__SW-48_Colon 
adenocarcinoma sscDNA 


7.8 


93 1 12_Mononuclear Cells 
(PBMCs)jresting 


2.4 


94936JSW1116__Colon 
adenocarcinoma sscDNA 


10.9 


93113 Mononuclear Cells 
(PBMCs)_PWM 


23.7 


94937JJS 174T_Colon 
adenocarcinoma sscDNA 


29.3 


93114 Mononuclear Cells 
(PBMCs)_PHA-L 


9.6 


94938_SW-948_Colon 
adenocarcinoma sscDNA 


1.9 


93249_Ramos (B cell)_none 


30.4 


94939JSW-480_Colon 
adenocarcinoma sscDNA 


4.9 


93250_Ramos (B ceU)_kmomycin 


100.0 


94940 NCI-SNU- 
5_Gastric 

carcinoma sscDNA 


8.3 


93349 B lymphocytes PWM 


70.2 


94941 JKATO m_Gastric 
carcinoma sscDNA 


53.1 


93350 Blymphoytes CD40L and 
IL-4 


5.5 


94943_NCI-SNU- 


7.3 


92665_EOL-l 


11.7 
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16_Gastric 
carcinoma sscDNA 




(Eosinophil)_dbcAMP 
differentiated 




94944_NCI-SNU- 
l_Gastric 

carcinoma sscDNA 


64.4 


93248JEOL-1 

(Eosmoplnl)_dbcAMP/PMAionom 
ycin 


6.3 


94946_RF-l_Gastric 
adenocarcinoma sscDNA 


11.4 


93356 JDendritic Cells_none 


12.4 


94947 JtF-48_Gastric 
adenocarcinoma sscDNA 


15.4 


93355 JDendritic Cells_LPS 100 
ng/ml 


9.0 


96778JtfKN-45_Gastric 
carcinoma sscDNA 


28.8 


93775_Dendritic Cells_anti-CD40 


12.5 


94949 J*a-N87j3astric 
carcinoma sscDNA 


19.5 


93774_Monocytes_resting 


15.2 


94951_OVCAR-5_Ovarian 
carcinoma sscDNA 


11.7 


93776_Monocytes_LPS 50 ng/ml 


11.7 


94952 JtL95-2JJterine 
carcinoma sscDNA 


4.5 


935 8 l_Macrophages jresting 


41.8 


94953_HelaS3_Cervical 
adenocarcinoma sscDNA 


11.3 


93582_Macrophages_LPS 100 
ng/ml 


6.8 


94954_Ca Ski_Cervical 
epidermoid carcinoma 
(metastasis)sscDNA 


24.3 


93098_HUVEC 
(Endothelial)_none 


35.8 


94955 JES-2_Ovarian clear 
cell carcinoma sscDNA 


16.1 


93099_HUVEC 
(Endothelial)_starved 


58.2 


94957_Ramos/6h stimj'; 
Stimulated with 
PMA/ionomycin 
6h sscDNA 


8.7 


93100 HUVEC (Endothelial) IL- 
lb ! 


16.5 


94958_Ramos/14h stimj'; 
Stimulated with 
PMA/ionomycin 
14h sscDNA 


8.3 


93779 JIUVEC (EndotheliaLLIFN 
gamma 


26.1 


94962_MEG-01_Chronic 
myelogenous leukemia 
(megokaryoblast)_sscDNA 


22.1 


93102_HUVEC 
(Endothelial)_TNF alpha + IFN 
gamma 


16.0 


94963 Jtej^Burkitt's 
lymphoma sscDNA 


8.3 


93101_HUVEC 
(Endothelial) TNF alpha + IL4 


23.2 


94964_Daudi_Burkitfs 
lymphomasscDNA 


19.8 


93781 HUVEC (Endothelial) IL- 
11 


13.6 


94965 JJ266_B-cell 
plasmacytoma/myeloma ss 
cDNA 


6.4 


93583_Lung Microvascular 
Endothelial Cells_none 


21.0 


94968j:A46_Burkitt , s 
lymphoma_sscDNA 


6.5 


93584_Lung Microvascular 
Endothelial Cells_TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


18.3 


94970 JILjion-Hodgkin's 
B-ceU lymphoma_ sscDNA 


9.0 


92662_Microvascular Dermal 
endothelium none 


35.4 


94972_JMl_pre-B-cell 
lymphoma/leukemia_sscD 


4.1 


92663_Microsvasular Dermal 
endomelium_TNFa (4 ng/ml) and 


20.0 
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NA 




ILlb (1 ng/ml) 




94973 JurkatJT cell 
leukemia_sscDNA 


9.8 


93773JBronchial 

epithelium TNFa (4 ng/ml) and 

ILlb(l ng/ml) ** 


14.9 


94974 JTF- 

1 Erythroleukemia sscDN 
A 


31.5 


93347_Small Airway 
Epithelium_none 


7.2 


94975 JfUT 78 JT-cell 
lymphoma_sscDNA 


15.7 


93348_SmalI Airway 
Epithelium TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


45.4 


94977_U937_Histiocytic 
lymphomasscDNA 


30.5 


92668_Coronery Artery 
SMCj-esting 


17.7 


94980JCU- 

8 12_Myelogenous 

leukemia sscDNA 


21.7 


92669 Coronery Artery 
SMCTNFa (4 ng/ml) and ILlb (1 
ng/ml) 


10.8 


9498 lJ769-P_Clear cell 
renal carcinoma sscDNA 


16.6 


93 1 07_astrocytes_resting 


11.1 


94983_Caki-2_Clear cell 
renal carcinoma sscDNA 


16.8 


93108_astrocytes_TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


10.5 


94984_SW 839_Clear cell 
renal carcinoma sscDNA 


11.8 


92666_KU-812 (Basopml)_restmg 


18.8 


94986_G401_Wilms' 
tumor sscDNA 


16.5 


92667_KU-812 
(Basophil) PMA/ionoycin 


27.9 | 


94987 JKs766T_Pancreatic 
carcinoma (LN 
metastasis)_sscDNA 


12.9 


93579_CCD1106 
(Keratinocytes)_none 


20.9 


94988_CAPAN- 
l_Pancreatic 
adenocarcinoma (liver 
metastasis)_sscDNA 


12.0 


93580JXD1106 

(Keratinocytes)_TNFa and IFNg 
** 


7.4 


94989 - SU86.86_Pancreati 
c carcinoma (liver 
metastasis)_sscDNA 


32.6 


93791 JLiver Cirrhosis 


3.7 


94990_BxPC-3_Pancreatic 
adenocarcinoma sscDNA 


4.0 


93792_Lupus Kidney 


1.9 


9499 1 JEiPAC_Pancreatic 
adenocarcinoma sscDNA 


26.4 


93577_NCI-H292 


18.6 


94992 JVHAPaCa- 
2JPancreatic 
carcinoma sscDNA 


8.2 


93358_NCI-H292_IL-4 


33.7 


94993_CFPAC- 
1 JPancreatic ductal 
adenocarcinoma sscDNA 


30,3 


93360_NCI-H292JL-9 


36.9 


94994JPANC-l_Pancreatic 
epithelioid ductal 
carcinoma sscDNA 


26.7 


93359JNCI-H292JL-13 


20.0 


94996_T24_Bladder 
carcinma (transitional 
cell) sscDNA 


8.2 


93357 J*a-H292JFN gamma 


20.4 
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94997_5637JBladder 
carcinoma sscDNA 


20.5 


93777_HPAEC_- 


18.8 


94998_HT-1 197_Bladder 
carcinoma sscDNA 


18.0 


93778_HPAEC_EL-1 beta/TNA 
alpha 


18.9 


94999JJM-UC-3J31adder 
carcinma (transitional 
cell)_sscDNA 


4.6 


93254_Nonnal Human Lung 
Fibroblast_none 


9.5 


95000_A204_Rhabdomyos 
arcoma_sscDNA 


20.0 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and IL- 
lb (1 ng/ml) 


3.7 


9500 1JHT- 

1080 Fibrosarcoma sscDN 
A 


15.4 


93257_Nonnal Human Lung 
FibroblastJL-4 


24.7 


95002_MG- 

63_Osteosarcoma 

(bone)_sscDNA 


16.2 


93256_Normal Human Lung 
Fibroblast_IL-9 


19.2 


95003_SK-LMS- 

l_Leiomyosarcoma 

(vulva)_sscDNA 


27.5 


93255_Normal Human Lung 
FibroblastJL-13 


14.3 


95004_SJRH30JRhabdom 
yosarcoma (met to bone 
marrow)_sscDNA 


12.5 


93258_Normal Human Lung 
FibroblastJFN gamma 


23.2 


95005_A43 ^Epidermoid 
carcinoma sscDNA 


6.9 


93106_Dermal Fibroblasts 
CCD1070_resting 


47.0 


95007 JVM266- 

4 Melanoma sscDNA 


7.5 


93361_Dermal Fibroblasts 
CCD1070_TNF alpha 4 ng/ml 


42.3 


95010_DU 145_Prostate 
carcinoma (brain 
metastasis)_sscDNA 


0.1 


93105_Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


20.2 


95012JMDA-MB- 
468_Breast 

adenocarcinoma sscDNA 


13.5 


93772_dermal fibroblastJFN 
gamma 


9.2 


95013_SCC-4_Squamous 
cell carcinoma of 
tonguesscDNA 


1.3 


93771_dermal fibroblast JL-4 


22.1 


95014_SCC-9_Squamous 
cell carcinoma of 
tongue_sscDNA 


0.3 


93259JBD Colitis 1** 


1.4 


950 1 5_SCC- 1 5_Squamous 
cell carcinoma of 
tongue sscDNA 


0.3 


93260 JBD Colitis 2 


1.1 


95017_CAL 27JSquamous 
cell carcinoma of 
tongue_sscDNA 


24.5 


93261JBD Crohns 


1.0 






735010J^olonnormal 


4.1 






735019_Lung L none 


8.9 






64028-lJThymus_none 


10.5 
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64030-l_Kidney_none 


3.6 



Panel L3D description: The gene MOL9b is expressed in a number of tissues, including the central 
nervous system, lung, mammary gland and kidney. Moreover, its expression seems to be enhanced in 
tumor cell lines relative to normal tissue in most cases with a good therapeutic window, 
5 Panel 2D description: Tissue distribution of the gene MOL9b in panel 2D confirms the results obtained in 
panel L3 D. There is enhanced expression of this gene (MOL9b) in tumor tissue relative to normal 
adjacent tissue, particularly in lung and kidney cancers, but also in cancers of colon, ovary, breast, 
gasiric, bladder, liver and thyroid Some metastases, particularly lung metastases and those from 
melanoma express this geneMOL9b at particularly high levels. Corroborative information about the 
10 expression of this molecule is available in the form of ESTs, mostly from endothelial cells, colon, ovarian 
tumors, pancreas and brain regions. Panel 3D for the gene MOL9b shows high level of expression in a 
variety of carcinomas, supporting results from panels L3D and 2D, and demonstrating utility for this 
protein as an antibody target. Therefore antibodies specific to this protein may be used as a therapeutic 
in the treatment of various types of cancer. 

15 

Panel 4D Description: The gene MOL9b is upregulated in endothelium, and epithelium 
regardless of stimulus. There is up regulation of this molecule in an ionomycin treated B cell 
line and it is highly expressed in mitogen (PWM) treated B cells. Consistent with this finding, 
PBMCs treated with PWM also up regulate this molecule. Induction of this gene MOL9b is also 
20 seen in activated T cells. In PBMC the T cell specific mitogen PHA induces the expression of 
this transcript and in acute and chronically activated T cells the expression of this transcript is 
increased as compared to untreated or resting T cells. This transcript is also expressed in resting 
macrophages. 

Potential Role(s) of in Inflammation: The molecule MOL9b is induced by B and T 
25 lymphocytes and may potentiate inflammation by regulating lymphocyte trafficking, or 
activation, or increasing tissue destruction. This molecule may also serve as a marker for 
activated T or B cells and may also be involved in the differentiation of monocytes into 
macrophages (see reference). Macrophages also participate in inflamrnation by producing 
multiple biologically active proteins like cytokines, activating other cells within the local 
30 microenvironment, and ingesting dead and dying cells. 

Impact of Therapeutic Targeting of MOL9b: Small molecule or antibody therapies to the 
molecule encoded for by MOL9b may reduce or eliminate inflammation and tissue damage due 
to T or B cell activation or macrophages and the bioactive molecules produced by these cell 
types. These diseases would include asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. 
35 Alternatively, protein therapeutics based on this molecule could serve as an adjuvant and help 
boost the effectiveness of vaccines or regulate immune status during organ transplant MOL9b 
also serves as a marker for activated T and B cells and serve as a diagnostic tool for deterniining 
indirectly measuring the extent of inflammation due to autoimmune diseases which induce T or 
B cells activation such as asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. Elevated 
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expression of the gene MOL9b in macrophages may help in distinguishing resting macrophages 
from monocytes , dendritic cells. 

MOLlOa 

Expression of gene MOLlOa was assessed using the primer-probe set Agl 129, described 
in Table 35. Results of the RTQ-PCR runs are shown in Table 36. 



Table 35. Probe name: Agll29 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5'-CTAGACCAGCAGCTGGATGAT-3' 


59.5 


21 


1518 


103 


Probe 


TET-5'- 

CTACAGACCAAGTTTGCTCGCCTCCT- 
3'-TAMRA 


68.7 


26 


1539 


104 


Reverse 


5'-CAATGCGGTAAGCAATCTTAAG-3' 


59 


22 


1587 


105 



Table 36. Panels 1.3D and 4D 



PANEL 1 JD 




PANEL 4D 




Tissue Name 


Rel. Expr., 
% 

1.3dx4tm57 
71t agl 129 
a2 


Tissue Name 


Rel. Expr., 
% 

4Dtml984t 
_agll29 


Liver 

adenocarcinoma 


3.9 


93768 Secondary Thl anti-CD28/anti- 
CD3 


0.0 


Pancreas 


3.7 


93769 Secondary Th2 anti-CD28/anti- 
CD3 


9.0 


Pancreatic ca. 
CAP AN 2 


2.8 


93770 Secondary Trl anti-CD28/anti- 
CD3 


9.0 


Adrenal gland 


1.8 


93573_Secondary Thl_resting day 4-6 
inIL-2 


0.0 


Thyroid 


3.2 


93572_Secondary Th2_resting day 4-6 
inIL-2 


0.0 | 


Salivary gland 


3.8 


93571 SecondaiyTrl resting day 4-6 in 
EL-2 


0.0 


Pituitary gland 


2.6 


93568_primary Thl anti-CD28/anti- 
CD3 


0.0 


Brain (fetal) 


21.9 


93569_primaryTh2 anti-CD28/anti- 
CD3 


0.0 


Brain (whole) 


14.6 


93570 jprimary Trl anti-CD28/anti- 
CD3 


8.9 ' 
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Brain (amygdala) 


13.0 


93565_primary Thl resting dy 4-6 in 
IL-2 


6.8 


Brain (cerebellum) 


1.8 


93566_primaryTh2 resting dy 4-6 in 
BL-2 


14.6 


Brain (hippocampus) 


13.7 


93567_jprimaryTrl resting dy 4-6 in IL- 
2 


0.0 


Brain (substantia 
nigra) 


21.4 


93351__CD45RA CD4 lymphocyte_anti- 
CD28/anti-CD3 


5.9 


Brain (thalamus) 


10.3 


93352_CD45RO CD4 lympnocyte_anti- 
CD28/anti-CD3 


14.3 


Cerebral Cortex 


10.5 


93251_CD8 Lymphocytes_anti- 
CD28/anti-CD3 


33.0 


Spinal cord 


8.1 


93353_chronic CD8 Lymphocytes 
2ry resting dy 4-6 in EL-2 


4.2 


CNS ca. (glio/astro) 
U87-MG 


6.0 


93574_chronic CD8 Lymphocytes 
2ry activated CD3/CD28 


0.0 


CNSca. (glio/astro) 
U-118-MG 


0.0 


93354_CD4_none 


1.4 


CNS ca. (astro) 
SW1783 


3.5 


93252 Secondary Thl/Th2/Trl_anti- 
CD95 CH11 


20.2 


CNS ca.* (neuro; met 
) SK-N-AS 


0.0 


93 103_LAK ceilsjresting 


8.7 


CNS ca. (astro) 
SF-539 


1.7 


93788 JLAK ceilsJL-2 


30.8 


CNS ca. (astro) 
SNB-75 


0.0 


93787 LAK cells_IL-2+IL-12 


15.6 


CNS ca. (glio) 
SNB-19 


0.9 


93789_LAK cells_EL-2+IFN gamma 


53.2 


CNS ca. (glio) 
U251 


1.4 


93790_LAK cells_IL-2+ IL-18 


20.6 


CNS ca. (glio) 
SF-295 


0.0 


93104 LAK cells PMA/ionomycin and 
IL-18 


0.0 


Heart (fetal) 


0.0 


93578_NK Cells IL-2jresting 


5.9 


Heart 


3.3 


93 109_Mixed Lymphocyte 
Reaction Two Way MLR 


12.9 


Fetal Skeletal 


2.0 


93 1 lOJVtixed Lymphocyte 
Reaction Two Way MLR 


3.5 


Skeletal muscle 


2.3 


93 1 1 ^Mixed Lymphocyte 
Reaction Two Way MLR 


6.8 


Bone marrow 


4.0 


931 12_Mononuclear Cells 
(PBMCs)_resting 


17.2 


Thymus 


3.9 


931 13_Mononuclear Cells 
fPRMCsi PWM 


32.5 


Spleen 


0.0 


93114 Mononuclear Cells 
(PBMCs)_PHA-L 


20.3 


Lymph node 


1.7 


93249_Ramos (B cell)_none 


22.2 


Colorectal 


1.4 


93250_Ramos (B cell)_ionomycin 


68.3 
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Stomach 


0.0 


93349_B lymphocytes J>WM 


33.7 


Small intestine 


0.0 


93350_B lymphoytes_CD40L and 3L-4 


9.4 


Colon ca. 
SW480 


1.5 


92665_EOL-l (Eosmophil)_dbcAMP 
differentiated 


15.3 


Colon ca.*(SW480 
met)SW620 


2.0 


93248_EOL-l 

(Eosinophil) dbcAMP/PMAionomycin 


0.0 


Colon ca. 
HT29 


2.2 


93356_Dendritic Cells_none 


0.0 


Colon ca. 
HCT-116 


3.7 


93355 JDendritic Cells_LPS 100 ng/ml 


6.7 


Colon ca. 
CaCo-2 


0.6 


93775_Dendritic Cells_anti-CD40 


0.0 


83219 CC Well to 
ModDiff(OD03866) 


5.2 


93774_Monocytes_resting 


20.6 


Colon ca. 
HCC-2998 


5.1 


93776 JVIonocytesJLPS 50 ng/ml 


15.8 


Gastric ca * (liver 
met)NCI-N87 


10.0 


935 8 ^Macrophages jresting 


15.9 


DlauxKx 


*T.U 


Marrnnhaires T PS 1 00 tiff/ml 


64.2 


Trachea 


30.4 


93098_HUVEC (EndotheUal)_none 


3.0 


Kidney 


3.3 


93099 JIUVEC (Endothelial)_starved 


0.0 


Kidney (fetal) 


7.3 


93100_HUVEC (Endomehal)JL-lb 


3.5 


Renal ca. 
786-0 


0.0 


93779JIUVEC (Endothelial)__IFN 
?amma 


6.7 


Renal ca. 
A498 


0.0 


93102_HUVEC (Endothelial^TNF 
alpha + 1FN gamma 


0.0 


Renal ca. 
RXF393 


4.4 


93101 JIUVEC (Endothelial)_TNF 
alpha + IL4 


8.1 


Renal ca. 
ACHN 


0.0 


93781 JIUVEC (Endothelial) JL-1 1 


0.0 


Renal ca. 
UO-31 


0.0 


93583 J^ung Microvascular Endothelial 
Cells none 


24.0 


Renal ca 
TK-10 


5.9 


93584_Lung Microvascular Endothelial 
Cells JTSIFa (4 ng/ml) and ILlb (1 
ng/ml) 


17.8 


Liver 


3.4 


92662_Microvascular Dermal 
endothelium none 


5.2 


Liver (fetal) 


0.0 


92663 Jvlicrosvasular dermal 
endothelium TNFa (4 ng/ml) and ILlb 
(1 ng/ml) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


93773 Bronchial epithelium TNFa (4 
ng/ml) and ILlb (1 ng/ml) ** 


79.6 


Lung 


30.2 


93347JSmall Airway Epithelium_none 


0.0 


Lung (fetal) 


100.0 


93348 JSmall Airway Epithelium_TNFa 
(4 ng/ml) and ILlb (1 ng/ml) 


53.6 
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Lung ca. (small cell) 
LX-1 


1.9 


92668_Coronery Artery SMC_resting 


0.0 


Lung ca. (small cell) 
NCI-H69 


9.8 


92669_Coronery Artery SMCTNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


9.5 


93 107_astrocytes_resting 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


93108 astrocytes TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


19.6 


Lung ca. (non-sm. 
cell) A549 


1.5 


92666JCQ-812 (Basophil)_resting 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


1.8 


92667_KU-812 
(Basophil)_PMA/ionoycin 


15.1 


Lung ca (non-s.cell) 
HOP-62 


0.8 


93579_CCD1 106 (Keratinocytes)_none 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


8.7 


93580_CCD1106 

(Keratinocytes) TNFa and IFNg ** 


4.7 


Lung ca. (squam.) 
SW 900 


1.8 


93791_Liver Cirrhosis 


47.6 


Lung ca. (squam.) 
NCI-H596 


6.7 


93792_Lupus Kidney 


25.0 


Mammary gland 


8.0 


93577_NCI-H292 


17.0 


Breast ca.* (pi. 
effusion) MCF-7 


0.0 


93358_NCI-H292_IL-4 


4.8 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


93360_NCI-H292_IL-9 


12.9 


Breast ca.* (pi. 
effusion^ T47D 


0.0 


93359_NCI-H292_IL-13 


0.0 


Breast ca. 
BT-549 


0.0 


93357_NCI-H292_IFN gamma 


4.4 1 


Breast ca. 
MDA-N 


0.0 


93777_HPAEC_- 


0.0 


Mvarv 
\J\aiy 




93778 TTPAFC TT-1 heta/TNA alnha 


69 


Ovarian ca. 
OVCAR-3 


0.0 


93254_Nonnal Human Lung 
Fibroblast none 


0.0 


Ovarian ca. 
OVCAR-4 


1.2 


93253_Normal Human Lung 
Fibroblast_TNFa (4 ng/ml) and IL-lb (1 
ng/ml) 


6.2 


Ovarian ca. 
OVCAR-5 


7.7 


93257_Nonnal Human Lung 
Fibroblast 1L-4 


6.9 


Ovarian ca. 
OVCAR-8 


0.0 


93256_Normal Human Lung 
Fibroblast IL-9 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


93255_Nonnal Human Lung 
Fibroblast IL-13 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


93258_Normal Human Lung 
FibroblastlFN gamma 


0.0 


Uterus 


1.9 


93106_Dermal Fibroblasts 
CCD1070 resting 


12.9 
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Plancenta 


0.0 


93361_Dennal Fibroblasts 
CCD1070 TNF alpha 4 ng/ml 


28.5 


Prostate 


4.9 


93 105_Dermal Fibroblasts 
CCD1070 IL-1 beta 1 ng/ml 


0.0 


Prostate ca.* (bone 
met)PC-3 


4.4 


93772_deimal fibroblastJFN gamma 


9.8 


Testis 


57.0 


93771_dennal fibroblast_IL-4 


2.6 


Melanoma 
Hs688(A).T 


0.0 


93259_IBD Colitis 1** 


88.3 


Melanoma* (met) 
Hs688(B).T 


0.0 


93260JBD Colitis 2 


0.0 


Melanoma 
UACC-62 


0.0 


93261_IBD Crohns 


0.0 


Melanoma 
M14 


0.0 


73501 0 J3olon_normal 


20.2 


Melanoma 
LOXIMVI 


3.6 


735019JLung^none 


29.7 


Melanoma* (met) 
SK-MEL-5 


3.3 


64028-1 JThymus_none 


100.0 


Adipose 


3.2 


64030- l_Kidney_none 


10.0 



The gene MOLlOa shows high levels in the testis and in fetal lung in panel 1 .3D. This 
indicates that this gene may be used for regeneration therapy in the lung and may also play a role 
in male fertility. The profile in panel 4D shows high expression in thymus with low to 
5 undetectable expression in other tissues and cell lines (Ct values >35). Therefore this gene may 
be involved in T-cell development and may be a marker for immature T cells. 



EXAMPLE 2: SNP ANALYSIS OF MOL6A 

Variant sequences are included in this application. A variant sequence can include a 
10 single nucleotide polymorphism (SNP). A SNP can, in some instances, be referred to as a 

"cSNP" to denote that the nucleotide sequence containing the SNP originates as a cDNA. A SNP 
can arise in several ways. For example, a SNP may be due to a substitution of one nucleotide for 
another at the polymorphic site. Such a substitution can be either a transition or a transversion. A 
SNP can also arise from a deletion of a nucleotide or an insertion of a nucleotide, relative to a 
15 reference allele. In this case, the polymorphic site is a site at which one allele bears a gap with 
respect to a particular nucleotide in another allele. SNPs occurring within genes may result in an 
alteration of the amino acid encoded by the gene at the position of the SNP. Intragenic SNPs 
may also be silent, however, in the case that a codon including a SNP encodes the same amino 
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acid as a result of the redundancy of the genetic code. SNPs occurring outside the region of a 
gene, or in an intron within a gene, do not result in changes in any amino acid sequence of a 
protein but may result in altered regulation of the expression pattern for example, alteration in 
temporal expression, physiological response regulation, cell type expression regulation, intensity 
5 of expression, stability of transcribed message. 

Method of novel SNP Identification: SNPs are identified by analyzing sequence 
assemblies using CuraGen's proprietary SNPTool algorithm. SNPTool identifies variation in 
assemblies with the following criteria: SNPs are not analyzed within 10 base pairs on both ends 
of an alignment; Window size (number of bases in a view) is 10; The allowed number of 

10 mismatches in a window is 2; Minimum SNP base quality (PHRED score) is 23; Minimum 

number of changes to score an SNP is 2/assembly position. SNPTool analyzes the assembly and 
displays SNP positions, associated individual variant sequences in the assembly, the depth of the 
assembly at that given position, the putative assembly allele frequency, and the SNP sequence 
variation. Sequence traces are then selected and brought into view for manual validation. The 

1 5 consensus assembly sequence is imported into CuraTools along with variant sequence changes to 
identify potential amino acid changes resulting from the SNP sequence variation. 
Comprehensive SNP data analysis is then exported into the SNPCalling database. 

Method of novel SNP Confirmation: SNPs are confirmed employing a validated 
method know as Pyrosequencing (See Alderborn et al. Determination of Single Nucleotide 

20 Polymorphisms by Real-time Pyrophosphate DNA Sequencing. (2000). Genome Research. 10, 
Issue 8, August. 1249-1265). In brief, Pyrosequencing is a real time primer extension process of 
genotyping. This protocol takes double-stranded, biotinylated PCR products from genomic 
DNA samples and binds them to streptavidin beads. These beads are then denatured producing 
single stranded bound DNA. SNPs are characterized utilizing a technique based on an indirect 

25 bioluminometric assay of pyrophosphate (PPi) that is released from each dNTP upon DNA chain 
elongation. Following Klenow polymerase-mediated base incorporation, PPj is released and used 
as a substrate, together with adenosine 5'-phosphosulfate (APS), for ATP sulfurylase, which 
results in the formation of ATP. Subsequently, the ATP accomplishes the conversion of luciferin 
to its oxi-derivative by the action of luciferase. The ensuing light output becomes proportional to 

30 the number of added bases, up to about four bases. To allow processivity of the method dNTP 
excess is degraded by apyrase, which is also present in the starting reaction mixture, so that only 
dNTPs are added to the template during the sequencing. The process has been fully automated 
and adapted to a 96-well format, which allows rapid screening of large SNP panels. 
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The DNA and protein sequences for the novel single nucleotide polymorphic variants of 
the Trypsin -like gene of MOL6a (GM87760758_A) are reported in Table 37. Variants are 
reported individually but any combination of all or a select subset of variants are also included. 
In Table 37, the positions of the variant bases and the variant amino acid residues are underlined. 
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Table 37. 

A. Variant 13373750 of MOL6a nucleotide sequence (SEQ ID NO. 11). 

C toA at position 360. 

B. Nucleotide sequence of variant at position 360. 

5 1TAC CATGAAATATGT CTT CTATT TGGGTGTC CT CGCTGGGAGATTiTT L'l'iTGCTGACT CAT CTG T C CAGAAAGAAGAC C 

81CTGCTCCCTATTTGGTGTACCTCAAGTCTC^ 
1 6 1 C CAGCT CACTG CTATTTACCAAATCTX3AAAGTGATG CTGGGAAATTTCAAGAGC^ 

2 4 1AATTAAC C C CATTCAGATCGTC CG CTACTGGAACTA CAGTCATAGCGC C C CACAGGATGAC CTCATG CT CATCAAGCTGG 

3 2 lCTAAGCCTGCCATGCTCAATCCCAAAGTCCAGCCCC^ 
10 4 01CTCTCAGGTTTGGACTGGAGCCAAGAAAACAGTC 

4 8 lTGATCGAGAATG C CAAAAAAACAGAAC^GGAAAAAG CCACAGGAATT C CTTATGTGTGAAATTTGTGAAAGTATTCAGC 

5 61CGAATTTTTGGGGAGGTGGCCGTTGC?^ 
641AGGGGACGTCGGCATCTACACCAATGTTTA 
721A (SEQ ID NO. 106) 

15 

C. Protein Sequence of variant at position 119. 

1MKYVPYLGVLAGTFPPADSSVQKEDPAP YliVYLKSHFNPCVGVLI KPSWVIlAPAHCyLPNLKV^^^Gl^FKSRVRDG^BQTI 
8 1NP I Q IVR YWNYS HS APQDDLML I KIiAKP AMLNP KVQ P LTLATTNVRPGTV CLL S GLDWSQENS GRHPDLRQNLiEAP VM S D 
1 6 1REC£KNRTRKKPQEFIJ4CB I CES I QPNFWGGGR^ CIIiD (SEQ ID 

20 NO. 107) 



D. Effect of variant on amino acid residue 
ProtoThr 
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EQUIVALENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims, which follow. In particular, it is contemplated by 

30 the inventors that various substitutions, alterations, and modifications may be made to the 

invention without departing from the spirit and scope of the invention as defined by the claims. 
The choice of nucleic acid starting material, clone of interest, or library type is believed to be a 
matter of routine for a person of ordinary skill in the art with knowledge of the embodiments 
described herein. Other aspects, advantages, and modifications considered to be within the scope 

35 of the following claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, wherein one or more amino acid residues in said variant differs from the 
amino acid sequence of said mature form, provided that said variant differs in no 
more than 15% of the amino acid residues from the amino acid sequence of said 
mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; and 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 15% of amino acid 
residues from said amino acid sequence. 

2 The polypeptide of claim 1, wherein said polypeptide comprises the amino acid sequence 
of a nataally<>ccurring allelic variant of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, and 30. 

3. The polypeptide of claim 2, wherein said allelic variant comprises an amino acid 
sequence that is the translation of a nucleic acid sequence differing by a single nucleotide 
from a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29. 

4. The polypeptide of claim 1 , wherein the amino acid sequence of said variant comprises a 
conservative amino acid substitution. 
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5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding a 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, wherein one or more amino acid residues in said variant differs from the 
amino acid sequence of said mature form, provided that said variant differs in no 
more than 15% of die amino acid residues from the amino acid sequence of said 
mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

(d) a variant of an amino acid sequence selected from the group consisting SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 15% of amino acid 
residues from said amino acid sequence; 

(e) a nucleic acid fragment encoding at least a portion of a polypeptide comprising an 
amino acid sequence chosen from the group consisting of SEQ ED NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or a variant of said polypeptide, 
wherein one or more amino acid residues in said variant differs from the amino 
acid sequence of said mature form, provided that said variant differs in no more 
than 15% of amino acid residues from said amino acid sequence; and 

(f) a nucleic acid molecule comprising the complement of (a), (b), (c), (d) or (e). 

6. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises the 
nucleotide sequence of a naturally-occurring allelic nucleic acid variant 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule encodes a 
polypeptide comprising the amino acid sequence of a naturally<>ccurring polypeptide 
variant. 
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8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29. 

9. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1 , 3, 5, 
7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29; 

(b) a nucleotide sequence differing by one or more nucleotides from a nucleotide 
sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19, 21, 23, 25, 27, and 29, provided that no more than 20% of the 
nucleotides differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragment of (b). 

1 0. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule hybridizes 
under stringent conditions to a nucleotide sequence chosen from the group consisting of 
SEQ ID NOS:l 3 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a complement of 
said nucleotide sequence. 

1 1 . The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence comprising a coding sequence differing by one or 
more nucleotide sequences from a coding sequence encoding said amino acid 
sequence, provided that no more than 20% of the nucleotides in the coding 
sequence in said first nucleotide sequence differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first polynucleotide; 
and 

(c) a nucleic acid fragment of (a) or (b). 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13. The vector of claim 12, further comprising a promoter operably-linked to said nucleic 
acid molecule. 
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1 4. A cell comprising the vector of claim 1 2. 

15. An antibody that binds immunospecifically to the polypeptide of claim 1 . 

16. The antibody of claim 15, wherein said antibody is a monoclonal antibody. 

1 7. The antibody of claim 1 5, wherein the antibody is a humanized antibody. 

1 8. A method for determining the presence or amount of the polypeptide of claim 1 in a 
sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

19. A method for determining the presence or amount of the nucleic acid molecule of claim 5 
in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid molecule; and 

(c) determining the presence or amount of die probe bound to said nucleic acid 
molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said sample. 

20. The method of claim 1 9 wherein presence or amount of the nucleic acid molecule is used 
as a marker for cell or tissue type. 

2 1 . The method of claim 20 wherein the cell or tissue type is cancerous. 

22. A method of identifying an agent that binds to a polypeptide of claim 1, the method 
comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 
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23. The method of claim 22 wherein the agent is a cellular receptor or a downstream effector. 

24. A method for identifying an agent that modulates the expression or activity of the 
polypeptide of claim 1, the method comprising: 

(a) providing a cell expressing said polypeptide; 

(b) contacting the cell with said agent, and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent modulates 
expression or activity of said polypeptide. 

25. A method for modulating the activity of the polypeptide of claim 1, the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a 
compound that binds to said polypeptide in an amount sufficient to modulate the activity 
of the polypeptide. 

26. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the 
polypeptide of claim 1 in an amount sufficient to treat or prevent said MOLX-associated 
disorder in said subject. 

27. The method of claim 26 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

28. The method of claim 26 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

29. The method of claim 26, wherein said subject is a human. 

30. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the nucleic 
acid of claim 5 in an amount sufficient to treat or prevent said MOLX-associated disorder 
in said subject 
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3 1 . The method of claim 30 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

32. The method of claim 30 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

33. The method of claim 30, wherein said subject is a human. 

34. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the antibody 
of claim 1 5 in an amount sufficient to treat or prevent said MOLX-associated disorder in 
said subject 

35. The method of claim 34 wherein the disorder is diabetes. 

36. The method of claim 34 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

37. The method of claim 34, wherein the subject is a human. 

38. A pharmaceutical composition comprising the polypeptide of claim 1 and a 
pharmaceutically-acceptable carrier. 

39. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 and a 
pharmaceutically-acceptable carrier. 

40. A pharmaceutical composition comprising the antibody of claim 15 and a 
pharmaceutically-acceptable carrier. 

41 . A kit comprising in one or more containers, the pharmaceutical composition of claim 3 8. 

42. A kit comprising in one or more containers, the pharmaceutical composition of claim 39. 
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43. A kit comprising in one or more containers, the pharmaceutical composition of claim 40. 

44. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the polypeptide of claim 1 in a first mammalian subject, the method 
comprising: 

(a) measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to the amount 
of the polypeptide present in a control sample from a second mammalian subject 
known not to have, or not to be predisposed to, said disease; 

wherein an alteration in the expression level of the polypeptide in the first subject as compared to 
the control sample indicates the presence of or predisposition to said disease. 

45. The method of claim 44 wherein the predisposition is to cancers. 

46. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the nucleic acid molecule of claim 5 in a first mammalian subject, the 
method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first mammalian 
subject; and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to the amount 
of the nucleic acid present in a control sample from a second mammalian subject 
known not to have or not be predisposed to, the disease; 

wherein an alteration in the level of the nucleic acid in the first subject as compared to the 
control sample indicates the presence of or predisposition to the disease. 

47. The method of claim 46 wherein the predisposition is to a cancer. 

48. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to alleviate the 
pathological state, wherein the polypeptide is a polypeptide having an amino acid 
sequence at least 95% identical to a polypeptide comprising an amino acid sequence of at 
least one of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, and 30, or a 
biologically active fragment thereof. 
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49. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal the antibody of claim 1 5 in an amount sufficient to alleviate 
the pathological state. 

50. A method for the screening of a candidate substance interacting with an olfactory 
receptor polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 
12, 14, 16, 1 8, 20, 22, 24, 26, 28, and 30, or fragments or variants thereof, comprises the 
following steps: 

a) providing a polypeptide selected from the group consisting of the 
sequences of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, or a peptide fragment or a variant thereof, 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; and 

d) detecting the complexes formed between said polypeptide and said 
candidate substance. 

51. A method for the screening of ligand molecules interacting with an olfactory receptor 
polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, and 30, wherein said method comprises: 

a) providing a recombinant eukaryotic host cell containing a nucleic acid 
encoding a polypeptide selected from the group consisting of the polypeptides 
comprising the amino acid sequences SEQ ED NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, and 30; 

b) preparing membrane extracts of said recombinant eukaryotic host cell; 

c) bringing into contact the membrane extracts prepared at step b) with a 
selected ligand molecule; and 

d) detecting the production level of second messengers metabolites. 

52. A method for the screening of ligand molecules interacting with an olfactory receptor 
polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, and 30, wherein said method comprises: 
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a) providing an adenovirus containing a nucleic acid encoding a polypeptide 
selected from the group consisting of polypeptides comprising the amino acid 
sequences SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

b) infecting an olfactory epithelium with said adenovirus; 

c) bringing into contact the olfactory epithelium b) with a selected ligand 
molecule; and 

d) detecting the increase of the response to said ligand molecule. 



225 



